JP2006079189A

JP2006079189A - Receipt file creation system, medical chart file creation system and file creation system

Info

Publication number: JP2006079189A
Application number: JP2004259900A
Authority: JP
Inventors: Shinya Kimura; 真也木村
Original assignee: Japan Medical Data Center Co Ltd
Current assignee: Japan Medical Data Center Co Ltd
Priority date: 2004-09-07
Filing date: 2004-09-07
Publication date: 2006-03-23
Anticipated expiration: 2024-09-07
Also published as: JP4955197B2

Abstract

<P>PROBLEM TO BE SOLVED: To associate a plurality of coexisting dialects of the medical industry with words of the standard language. <P>SOLUTION: A receipt file creation system 3 includes a master table storage part 70 for storing one of a plurality of synonyms in a master table as one standard word, and a dictionary table storage part 72 for storing the other synonyms, i.e., dialects, in a dictionary table in association with the one standard word. A matching process part 52 searches the dictionary table for text data. When the text data is found to be stored in the dictionary table, a receipt file creating part 54 creates a receipt file in which standard words for the text data are associated with the text data. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、ファイルを生成する技術に関する。 The present invention relates to a technique for generating a file.

病院などの医療機関では、医師が、傷病名、投薬、注射、検査、手術などの医療行為の内容（以下、「診療内容」という）をカルテに記入する。近年、多くの医療機関にレセプトコンピュータと呼ばれる処理装置（「医事コンピュータ」ともいう）が導入されており、医療機関の担当者は、カルテをもとに診療内容をレセプトコンピュータに入力して、レセプトコンピュータに記憶されたフォーマットで診療報酬明細書（以下、「レセプト」という）を作成する。また病院だけではなく、薬局においてもレセプトコンピュータの普及が進んでいる。医療機関は、レセプトを、診療報酬請求書（以下、「請求書」という）とともに各都道府県単位の社会保険診療報酬支払基金に提出する。支払基金は、投薬、注射、手術などの請求点数に誤りがないかを点検し、審査委員会が、支払基金にて点検された請求書およびレセプトを審査する。このような審査を終えたレセプトに基づいて、診療報酬額が決定される。健康保険組合などの保険者は、審査を経たレセプトを二次審査し、支払基金などを通じて医療機関に診療報酬を支払う一方で、診療内容に疑問のあるものについては審査委員会に対して再審査を請求する。
特許第３１３９４８５号明細書 In a medical institution such as a hospital, a doctor fills in a medical record with the name of a medical condition such as injury and illness, medication, injection, examination, and surgery (hereinafter referred to as “medical treatment content”). In recent years, many medical institutions have introduced a processing device called a “receipt computer” (also referred to as a “medical computer”), and a person in charge of the medical institution inputs the contents of medical treatment into the reception computer based on the medical record. A medical remuneration statement (hereinafter referred to as “receipt”) is created in a format stored in the computer. In addition to hospitals, reception computers are spreading in pharmacies. The medical institution submits the receipt together with a medical fee bill (hereinafter referred to as “bill”) to the social insurance medical fee payment fund of each prefecture. The payment fund checks whether there are any errors in the number of claims for medication, injection, surgery, etc., and the review committee reviews the bills and receipts checked by the payment fund. The amount of medical treatment fee is determined based on the receipt after such examination. Insurers, such as health insurance associations, conduct a secondary review of the reviews that have been reviewed, and pay medical fees to medical institutions through payment funds, etc., while those with doubts about medical treatment are reexamined with the review committee To charge.
Japanese Patent No. 3139485

近年、財政赤字の問題もあり、保険者が二次審査を強化している。医療機関の手違いなどにより誤った請求がなされることもあるため、それを二次審査により見つけて再審査にかけることで、過剰な診療報酬の支払を避けることを目的としている。しかしながら、一方で、レセプトの点検には医学的な専門知識が要求されるため、保険者の誤解に基づいた再審査請求が行われることもある。近頃、規制緩和の一環として、健康保険組合が、レセプトの審査・支払業務を医療機関に対して直接行うことも可能となった。利害が相反する保険者と医療機関とが直接交渉することになるため、いずれかの誤解に基づく無用なトラブルが発生する事態も考えられる。 In recent years, there has been a budget deficit problem, and insurers have strengthened the secondary examination. Incorrect claims may be made due to mistakes in medical institutions, etc., so the aim is to avoid excessive payment of medical fees by finding them through a secondary review and submitting them to a reexamination. However, on the other hand, since medical expertise is required for the inspection of the receipt, a reexamination request may be made based on the misunderstanding of the insurer. Recently, as part of the deregulation, it became possible for the health insurance union to conduct the screening and payment of receipts directly to medical institutions. Since insurers and medical institutions that have conflicting interests will negotiate directly, there may be cases where unnecessary troubles occur due to any misunderstanding.

一般にレセプトは紙ベースで受け渡されることが多い。レセプトは、患者に施された医療行為を表現するものであり、複数月さらには複数年にまたがった患者のレセプトの情報は、その患者の時系列的な傷病履歴を表現する。保険者側で患者個人の傷病履歴をまとめることができれば、その傷病履歴を解析して、個々の患者の健康管理にも役立てることができる。紙ベースのレセプトをテキスト化することで、患者の傷病履歴をデータとして効率的に管理できるとともに、患者に継続して施された診療行為が適切なものであるかをチェックすることも可能となる。 In general, receipts are often delivered on a paper basis. The receipt expresses a medical practice performed on the patient, and the information on the patient's receipt over a plurality of months or even a plurality of years expresses the chronological injury history of the patient. If the insurer can summarize the injury history of individual patients, the injury history can be analyzed and used for health management of individual patients. By making paper-based receipts into text, it is possible to efficiently manage the patient's injury and illness history as data, and also to check whether the patient's ongoing medical practice is appropriate .

我が国は、統計法に基づく統計調査や、医学的分類として医療機関における診療録の管理等に利用することを目的として、ＩＣＤ−１０に準拠した「疾病、傷害及び死因分類」を作成している。「疾病、傷害及び死因分類」に示される用語は、傷病名について我が国の医療業界で統一的に用いられることが好ましく、全てのカルテないしはレセプトが、「疾病、傷害及び死因分類」の用語を用いて作成されることで、本来意図する統計調査や、診療録の管理等を効率的に行うことが可能となる。 Japan has created the “Disease, Injury and Death Cause Classification” in accordance with ICD-10 for the purpose of statistical surveys based on statistical methods and management of medical records as medical classifications at medical institutions. . The terms shown in the “Disease, Injury and Death Cause Classification” are preferably used in the medical industry in Japan for the names of wounds and diseases, and all medical records or receipts use the term “Disease, Injury and Death Cause Classification”. Thus, it is possible to efficiently perform originally intended statistical surveys, medical record management, and the like.

しかしながら、医療業界では慣例的に、１つの傷病や医薬品などが、複数の呼び名で表現されることがある。その他、検査名や医薬品の投与量単位の表現などについても同様である。傷病名に関して、上記した「疾病、傷害及び死因分類」に基づく用語を「標準語」と呼び、同意の他の用語を「方言」と呼ぶ場合、方言の使用は何も特別なことではなく、医師によっては、使用している用語が方言であることの意識すらないこともある。現在、方言を統一化して、標準語を使用していこうという業界の流れはあるが、その意識が全ての医師に浸透するはずはなく、医師ごと、またはレセプトコンピュータごとに、使い慣れた方言を使用し続けている現状がある。また、医療機関の種類によっても方言を使用する事情が存在し、例えば小規模な診療所や専門病院などでは、無理に標準語を使用することが管理業務を非効率にすることもある。 However, in the medical industry, it is customary that a single injury or illness or medicine is expressed by a plurality of names. The same applies to the name of the test and the dosage unit of the drug. Regarding the name of injury and illness, when the term based on the above `` disease, injury and death classification '' is called `` standard language '' and other terms of consent are called `` dialect '', the use of the dialect is nothing special, Some doctors may not be aware that the terminology used is a dialect. Currently, there is an industry trend to standardize dialects and use standard words, but that consciousness should not permeate all doctors, and use familiar dialects for each doctor or for each reception computer. There is a current situation that continues. In addition, there are circumstances in which dialects are used depending on the type of medical institution. For example, in a small clinic or specialized hospital, using a standard language forcibly may make management work inefficient.

一方で、レセプトを審査する立場からすると、用語が統一されていなければ、傷病名や医薬品名などを容易に理解できず、効率的な審査が阻害される。そのため、紙レセプトの内容を単にテキスト化しただけでは、全ての方言を理解する技量をもつ人間しかレセプトのチェックはできないことになる。また、患者が複数の医療機関にかかった場合に、医療機関ごとに用語が統一されていなければ、レセプトのテキストデータから患者個人の健康管理に役立つ情報を抽出することも容易でなく、さらには全ての患者のデータを用いた統計的な処理を行うこともできない。
そこで本発明は、方言として存在する同義語に標準語を対応付ける技術を提供し、利便性の高いデータファイルを生成することを目的とする。 On the other hand, from the standpoint of reviewing the receipt, if the terms are not unified, the names of wounds and medicines cannot be easily understood, and efficient screening is hindered. For this reason, only the person who has the skill to understand all dialects can check the receipt by simply converting the contents of the paper receipt into text. In addition, when a patient visits multiple medical institutions, if the terminology is not uniform for each medical institution, it is not easy to extract information useful for individual patient health management from the text data of the receipt, It is not possible to perform statistical processing using all patient data.
Therefore, an object of the present invention is to provide a technique for associating a standard word with a synonym existing as a dialect and to generate a highly convenient data file.

上記課題を解決するために、本発明のある態様は、レセプトファイルを生成するシステムに関する。この態様のレセプトファイル生成システムは、標準語となる医療関係用語と、その識別コードおよび属性情報とを対応付けたマスタテーブルを格納するマスタテーブル格納部と、医療関係用語として標準語と同じ意味を表す同義語を、該標準語の識別コードに対応付けた辞書テーブルを格納する辞書テーブル格納部と、レセプトをテキスト化したデータファイルに含まれるテキストデータを抽出するテキストデータ処理部と、抽出されたテキストデータを、辞書テーブルにおいて検索するマッチング処理部と、マッチング処理部により、辞書テーブルにテキストデータが医療関係用語として記憶されていることが判明した場合に、識別コードをもとに、該テキストデータをマスタテーブルの標準語に対応付けたレセプトファイルを生成するレセプトファイル生成部とを備える。医療関係用語とは、医療に関する用語（名前）であり、医療行為を表現する用語だけでなく、医薬品名なども含み、特にレセプトの表記に使用される用語を意味する。具体的に、医療関係用語は、傷病名、医薬品名、医薬品の投与量単位、医療材料名、診療行為名などの用語を含む。標準語と同じ意味を表す同義語は、標準語以外の同義語（方言）を含む。同義語には、標準語自身が含まれてもよく、この場合には辞書テーブルに標準語も含まれることになるので、マッチング処理部は、辞書テーブルを検索することで、標準語を探し出すことができる。なお、辞書テーブルに標準語が含まれない場合は、マッチング処理部がマスタテーブルを検索することで、標準語を探し出すことができる。 In order to solve the above problems, an aspect of the present invention relates to a system for generating a receipt file. The receipt file generation system of this aspect includes a master table storage unit that stores a master table that associates medical related terms that are standard words with their identification codes and attribute information, and has the same meaning as standard terms as medical related terms. A dictionary table storage unit that stores a dictionary table in which synonyms to be represented are associated with identification codes of the standard words, a text data processing unit that extracts text data included in a data file in which a receipt is converted into text, and When the text data is searched for in the dictionary table and the matching processing unit finds that the text data is stored as a medical term in the dictionary table, the text data is based on the identification code. That generates a receipt file that associates a master table with a standard word And a script file generation unit. The medical-related term is a term (name) related to medical treatment, and includes not only a term representing medical practice but also a drug name and the like, and particularly means a term used for notation of a receipt. Specifically, medical-related terms include terms such as injury and illness name, drug name, drug dosage unit, medical material name, and medical practice name. Synonyms representing the same meaning as standard words include synonyms (dialects) other than standard words. The synonym may include the standard word itself. In this case, the standard word is also included in the dictionary table, so the matching processing unit searches for the standard word by searching the dictionary table. Can do. When the standard word is not included in the dictionary table, the matching processor can search for the standard word by searching the master table.

このレセプトファイル生成システムによると、標準語以外の同義語を、標準語に対応付けたレセプトファイルを生成することができる。このシステムにより生成されたレセプトファイルを利用すると、標準語をキーとして、統計処理や個人の健康管理などの様々なデータ処理を行うことが可能となる。 According to this receipt file generation system, it is possible to generate a receipt file in which synonyms other than standard words are associated with standard words. By using the receipt file generated by this system, various data processing such as statistical processing and personal health management can be performed using the standard word as a key.

マッチング処理部は、辞書テーブルにテキストデータが医療関係用語として記憶されていない場合に、そのテキストデータを、不明データとして所定の格納領域に記憶させる。一旦、不明データとして記憶させることで、そのデータを、後にマスタテーブルや辞書テーブルの拡張に利用することが可能となる。 When the text data is not stored as medical terms in the dictionary table, the matching processing unit stores the text data as unknown data in a predetermined storage area. Once stored as unknown data, the data can be used later for expansion of the master table or dictionary table.

このレセプトファイル生成システムは、辞書テーブルに記憶されておらず、且つ通常使用されない文字列を、標準語の識別コードに対応付けた一時辞書テーブルを格納する一時辞書テーブル格納部をさらに備え、レセプトファイル生成部は、一時辞書テーブルをもとに、文字列のテキストデータを、マスタテーブルの標準語に対応付けたレセプトファイルを生成する。辞書テーブルに記憶されておらず、且つ通常使用されない文字列としては、例えば、誤入力された表現や、特定の医療機関のみで用いられる隠語的表現など、一般には用いられない異例的な表現が該当する。そのような文字列であっても、文字列自体を保存しながら、マスタテーブルに記憶されている標準語に対応付けたレセプトファイルを生成することで、もとのレセプトの内容を維持しながら、有用性の高いレセプトファイルを生成することが可能となる。 The receipt file generation system further includes a temporary dictionary table storage unit that stores a temporary dictionary table in which character strings that are not stored in the dictionary table and are not normally used are associated with identification codes of standard words. The generation unit generates a receipt file in which character string text data is associated with a standard word in the master table based on the temporary dictionary table. Examples of character strings that are not stored in the dictionary table and are not normally used include unusual expressions that are not generally used, such as expressions that are entered incorrectly or slang expressions that are used only by specific medical institutions. Applicable. Even if it is such a character string, while maintaining the content of the original receipt by generating a receipt file associated with the standard word stored in the master table while saving the character string itself, A highly useful receipt file can be generated.

レセプトファイル生成部は、辞書テーブルに記憶されておらず、且つ通常使用されない文字列のテキストデータについて一時辞書テーブルに記憶した対応関係を所定の期間に限って利用して、レセプトファイルを生成してもよい。所定の期間に限って利用させることで、その文字列と同じ医療関係用語が後に登場した場合であっても、レセプトファイルの有用性を維持することができる。 The receipt file generation unit generates a receipt file by using the correspondence stored in the temporary dictionary table for text data of character strings that are not stored in the dictionary table and are not normally used for a predetermined period. Also good. By using it only for a predetermined period, it is possible to maintain the usefulness of the receipt file even when the same medical terms as the character string appear later.

マスタテーブル格納部および辞書テーブル格納部のそれぞれは、マスタテーブルおよび辞書テーブルのそれぞれを、医療関係用語の種類ごとに格納する。例えば、傷病名、医薬品名など、医療関係用語の種類ごとにマスタテーブルおよび辞書テーブルを作成することで、それぞれのテーブルの更新、管理、保守などを容易に実行できる。 Each of the master table storage unit and the dictionary table storage unit stores the master table and the dictionary table for each type of medical related term. For example, by creating a master table and a dictionary table for each type of medical-related terms such as names of wounds and medicines, it is possible to easily update, manage, and maintain each table.

このレセプトファイル生成システムは、文字列を、参照する辞書テーブルの種類に対応付けた振分テーブルを格納する振分テーブル格納部をさらに備え、マッチング処理部は、振分テーブルを参照して、抽出されたテキストデータに対して参照する辞書テーブルを特定してもよい。マッチング処理部は、振分テーブルを参照することで、辞書テーブルの特定に要する時間を短縮できる。 The receipt file generation system further includes a distribution table storage unit that stores a distribution table in which a character string is associated with a type of dictionary table to be referred to, and the matching processing unit extracts the character string by referring to the distribution table. A dictionary table to be referenced with respect to the text data thus set may be specified. The matching processing unit can shorten the time required to specify the dictionary table by referring to the distribution table.

このレセプトファイル生成システムは、文字列と、その文字列を区切って分解した複数の医療関係用語に対応付けた分解テーブルを格納する分解テーブル格納部をさらに備え、テキストデータ処理部は分解テーブルを参照して、文字列のテキストデータを医療関係用語ごとに分解したテキストデータを抽出してもよい。分解テーブルを利用することで、テキストデータ処理部は、文字列から効率的に医療関係用語を抽出することが可能となる。 The receipt file generation system further includes a decomposition table storage unit that stores a character string and a decomposition table associated with a plurality of medical terms that are decomposed by dividing the character string, and the text data processing unit refers to the decomposition table. Then, text data obtained by decomposing the text data of the character string for each medical term may be extracted. By using the disassembly table, the text data processing unit can efficiently extract medical terms from a character string.

このレセプトファイル生成システムは、複数の文字列を結合して生成される医療関係用語をリスト化した結合テーブルを格納する結合テーブル格納部をさらに備え、テキストデータ処理部は結合テーブルを参照して、複数の文字列のテキストデータを結合したテキストデータを抽出してもよい。結合テーブルを利用することで、例えば、１つの医療関係用語において間に空白が挿入されているものや、１つの医療関係用語が改行されて複数段にまたがって記載されているものを、１つの医療関係用語として効率的に抽出することが可能となる。 The receipt file generation system further includes a combined table storage unit that stores a combined table that lists medical related terms generated by combining a plurality of character strings, and the text data processing unit refers to the combined table, Text data obtained by combining text data of a plurality of character strings may be extracted. By using a combination table, for example, one medical related term with a blank inserted in between, or one medical related term inserted into a new line and described across multiple levels It can be efficiently extracted as medical terms.

マッチング処理部は、抽出されたテキストデータと、マスタテーブルまたは辞書テーブルに記憶された医療関係用語との一致性を判断し、一致しない部分が数量表現である場合に、その数量表現を、マスタテーブルまたは辞書テーブルに記憶された医療関係用語に含まれる数量表現と対応付けるべきものと判断してもよい。辞書テーブルにテキストデータが示す表現が存在しない場合であっても、マッチング処理部が自律的にテキストデータと登録済の医療関係用語との紐付けを行うことで、人的処理の負担を低減することができる。 The matching processing unit determines the matching between the extracted text data and the medical-related terms stored in the master table or the dictionary table, and when the unmatched portion is a quantity expression, the quantity expression is Alternatively, it may be determined that it should be associated with the quantity expression included in the medical terms stored in the dictionary table. Even if there is no expression that the text data shows in the dictionary table, the matching processing unit autonomously links the text data with the registered medical terms to reduce the burden of human processing be able to.

本発明の別の態様は、カルテファイルを生成するシステムに関する。この態様のカルテファイル生成システムは、標準語となる医療関係用語と、その識別コードおよび属性情報とを対応付けたマスタテーブルを格納するマスタテーブル格納部と、医療関係用語として標準語と同じ意味を表す同義語を、該標準語の識別コードに対応付けた辞書テーブルを格納する辞書テーブル格納部と、電子カルテのデータファイルに含まれるテキストデータを抽出するテキストデータ処理部と、抽出されたテキストデータを、辞書テーブルにおいて検索するマッチング処理部と、マッチング処理部により、辞書テーブルにテキストデータが医療関係用語として記憶されていることが判明した場合に、識別コードをもとに、該テキストデータをマスタテーブルの標準語に対応付けたカルテファイルを生成するファイル生成部とを備える。医療関係用語とは、医療に関する用語（名前）であり、医療行為を表現する用語だけでなく、医薬品名なども含み、ここでは特にカルテの表記に使用される用語を意味する。具体的に、医療関係用語は、傷病名、医薬品名、医薬品の投与量単位、医療材料名、診療行為名などの用語を含む。標準語と同じ意味を表す同義語は、標準語以外の同義語（方言）を含む。同義語には、標準語自身が含まれてもよく、この場合には辞書テーブルに標準語も含まれることになるので、マッチング処理部は、辞書テーブルを検索することで、標準語を探し出すことができる。なお、辞書テーブルに標準語が含まれない場合は、マッチング処理部がマスタテーブルを検索することで、標準語を探し出すことができる。 Another aspect of the present invention relates to a system for generating a medical record file. The medical record file generation system according to this aspect includes a master table storage unit that stores a master table that associates medical related terms that are standard words with their identification codes and attribute information, and has the same meaning as standard words as medical related terms. A dictionary table storage unit that stores a dictionary table in which synonyms to represent are associated with identification codes of the standard words, a text data processing unit that extracts text data included in a data file of the electronic medical record, and extracted text data If the matching processing unit that searches the dictionary table and the matching processing unit find that the text data is stored as medical terms in the dictionary table, the text data is mastered based on the identification code. A file generator that generates a chart file associated with the standard words of the table Obtain. The medical-related term is a term (name) related to medical treatment, and includes not only a term representing medical practice but also a pharmaceutical name, and means a term particularly used for notation of medical records. Specifically, medical-related terms include terms such as injury and illness name, drug name, drug dosage unit, medical material name, and medical practice name. Synonyms representing the same meaning as standard words include synonyms (dialects) other than standard words. The synonym may include the standard word itself. In this case, the standard word is also included in the dictionary table, so the matching processing unit searches for the standard word by searching the dictionary table. Can do. When the standard word is not included in the dictionary table, the matching processor can search for the standard word by searching the master table.

このカルテファイル生成システムによると、標準語以外の同義語を、標準語に対応付けたカルテファイルを生成することができる。標準語に対応づけたカルテファイルを生成することで、複数の病院のカルテにおける医療関係用語を実質的に統一化することが可能となる。 According to this chart file generation system, a chart file in which synonyms other than standard words are associated with standard words can be generated. By generating a medical record file associated with a standard word, it is possible to substantially unify medical related terms in medical records of a plurality of hospitals.

本発明のさらに別の態様は、標準語となる用語と、その識別コードおよび属性情報とを対応付けたマスタテーブルを格納するマスタテーブル格納部と、標準語と、標準語と同じ意味を表す同義語とを、該標準語の識別コードに対応付けた辞書テーブルを格納する辞書テーブル格納部と、データファイルに含まれるテキストデータを抽出するテキストデータ処理部と、抽出されたテキストデータを、辞書テーブルにおいて検索するマッチング処理部と、マッチング処理部により、辞書テーブルにテキストデータが記憶されていることが判明した場合に、識別コードをもとに、該テキストデータをマスタテーブルの標準語に対応付けたデータファイルを生成するファイル生成部とを備える。 Still another aspect of the present invention provides a master table storage unit that stores a master table that associates a term that becomes a standard word with its identification code and attribute information, a standard word, and a synonym that represents the same meaning as the standard word. A dictionary table storage unit for storing a dictionary table in which words are associated with identification codes of the standard words, a text data processing unit for extracting text data included in the data file, and the extracted text data in the dictionary table When the matching processing unit that searches for and the matching processing unit finds that the text data is stored in the dictionary table, the text data is associated with the standard word of the master table based on the identification code. A file generation unit for generating a data file.

このファイル生成システムによると、標準語以外の同義語（方言）を、標準語に対応付けたファイルを生成することができ、したがって、標準語をキーとして、ファイルを用いた様々なデータ処理を行うことが可能となる。 According to this file generation system, a file in which a synonym (dialect) other than a standard word is associated with a standard word can be generated. Therefore, various data processing using the file is performed using the standard word as a key. It becomes possible.

なお、以上の構成要素の任意の組合せ、本発明の表現を方法、装置、システム、記録媒体、コンピュータプログラムなどの間で変換したものもまた、本発明の態様として有効である。 It should be noted that any combination of the above-described constituent elements and a conversion of the expression of the present invention between a method, an apparatus, a system, a recording medium, a computer program, etc. are also effective as an aspect of the present invention.

本発明によると、標準語と標準語以外の同義語とを対応付けたファイルを生成することができる。 According to the present invention, a file in which a standard word and a synonym other than the standard word are associated can be generated.

図１は、本発明の実施例におけるレセプト処理フローを示す。このレセプト処理フローは、１つの主体により実行されてもよく、複数の主体により協働して実行されてもよい。通常は、複数の主体で明確な役割分担を行い、全体として１つのレセプト処理システムを実現するケースが多いと考えられる。図１に示すレセプト処理フローでは、医療機関または支払基金などから提供される紙レセプトをデータ化し、統計処理などのデータ加工に適したレセプトファイルを効率的に生成する手順を表現している。以下に示す各ステップは、人手が介在することもあるが、多くはシステムによるコンピュータ処理により実行される。 FIG. 1 shows a reception processing flow in an embodiment of the present invention. This receipt process flow may be executed by one main body or may be executed in cooperation by a plurality of main bodies. Usually, it is considered that there are many cases in which a single role processing system is realized as a whole by performing a clear division of roles among a plurality of subjects. In the receipt processing flow shown in FIG. 1, a paper receipt provided from a medical institution or a payment fund is converted into data, and a procedure for efficiently generating a receipt file suitable for data processing such as statistical processing is expressed. Each step shown below may involve human intervention, but most are executed by computer processing by the system.

本実施例のレセプト処理フローは、仕分け作業（Ｓ１０）、紙レセプトのイメージ化（Ｓ１２）、ＯＣＲ処理（Ｓ１４）、パンチ処理（Ｓ１６）、論理チェック処理（Ｓ１８）、対応チェック処理（Ｓ２０）、テキストデータ分解処理（Ｓ２２）、テキストデータ結合処理（Ｓ２３）、分解テーブル、結合テーブルの更新（Ｓ２４）、辞書テーブルの読出し（Ｓ２６）、マッチング処理（Ｓ２８）、マスタテーブル、辞書テーブルの更新（Ｓ３０）、一時辞書テーブル生成（Ｓ３２）、レセプトファイル生成（Ｓ３４）、データ加工（Ｓ３６）の処理ステップに分けることができる。 The receipt processing flow of the present embodiment is as follows: sorting operation (S10), paper receipt imaging (S12), OCR processing (S14), punching processing (S16), logic check processing (S18), correspondence check processing (S20), Text data decomposition process (S22), text data combination process (S23), decomposition table, update of combined table (S24), reading of dictionary table (S26), matching process (S28), update of master table and dictionary table (S30) ), Temporary dictionary table generation (S32), receipt file generation (S34), and data processing (S36).

（１）仕分け作業
仕分け作業（Ｓ１０）では、入院レセプト、入院外レセプト、調剤レセプトなどの紙媒体に印刷された各帳票を仕分けする。この仕分け作業は、一般には人手により行われるが、後続のＳ１２のイメージ化の際に、コンピュータ処理により自動的に実行されてもよい。 (1) Sorting work In the sorting work (S10), each form printed on a paper medium such as an in-hospital receipt, an out-of-hospital receipt, or a dispensing receipt is sorted. This sorting operation is generally performed manually, but may be automatically executed by computer processing at the time of subsequent imaging in S12.

（２）紙レセプトのイメージ化
紙レセプトのイメージ化（Ｓ１２）では、仕分けされた紙レセプトをスキャナにより読み込んで、イメージデータに変換する。ここでは、スキャナに紙レセプトを連続入力しながら、入力中の画像をディスプレイにリアルタイムに表示し、オペレータがスキャニング状況を確認する。紙レセプトが裏面で入力されたような場合には、画像認識処理により自動的にスキャニングを停止し、オペレータにその旨を通知する。また、用紙方向が上下逆のような場合にも、オペレータにその旨を通知する。以上により、向きの揃った紙レセプトのイメージデータを生成する。なお、スキャナのＣＰＵに、画像認識機能だけでなく、レセプト仕分け機能を追加することで、紙レセプトのイメージ化を行いながら、同時に仕分け処理を行って、イメージデータを、レセプトの種類ごとのフォルダに格納していく。 (2) Imaging of paper receipt In imaging of paper receipt (S12), the sorted paper receipt is read by a scanner and converted into image data. Here, while continuously inputting paper receipts into the scanner, the image being input is displayed in real time on the display, and the operator confirms the scanning status. When a paper receipt is input on the back side, scanning is automatically stopped by the image recognition process, and the operator is notified accordingly. Also, when the paper direction is upside down, the operator is notified of this. As described above, the image data of the paper receipt having the same orientation is generated. By adding not only the image recognition function but also the receipt sorting function to the scanner CPU, the paper receipt is imaged and the sorting process is performed at the same time, and the image data is stored in a folder for each type of receipt. Store it.

（３）ＯＣＲ処理
ＯＣＲ処理（Ｓ１４）では、レセプトのイメージデータから光学的文字認識により文字を読み取る。レセプトの上部には、被保険者記号番号や患者の氏名などの個人情報が記載されており、その下方には、傷病名や、投薬、注射などの診療情報、使用した医薬品とその使用量を示す診療明細が記載されている。本明細書では、レセプトの個人情報以外の情報を薬歴情報と呼ぶ。レセプトの薬歴情報は、レセプトの個人情報により、特定の個人と結び付けられることによって秘密に保護されるべきものであり、その取扱いには十分な注意が必要となる。そのため、ＯＣＲ処理の前段階として、まず、個人情報画像と薬歴画像とを切り離す。 (3) OCR process In the OCR process (S14), characters are read from the image data of the receipt by optical character recognition. Personal information such as the insured symbol number and patient's name is listed at the top of the receipt, and below it is the name of the wound, medical information such as medications and injections, the drugs used and their usage. The medical details to be shown are described. In the present specification, information other than personal information of the receipt is referred to as drug history information. The medical history information of a receipt should be protected secretly by being associated with a specific individual by the personal information of the receipt, and handling thereof requires sufficient care. Therefore, as a pre-stage of OCR processing, first, the personal information image and the medicine history image are separated.

最初にレセプトの個人情報画像をＯＣＲ処理によりテキスト化し、暗号技術によりユニークコードに変換する。続いて、このユニークコードを、レセプトの薬歴画像の画像データに結合する。これにより、オペレータは、この結合されたデータをみても個人を特定することができず、一方で、暗号化した個人情報と診療明細情報とを紐付けできる。次に、薬歴画像をＯＣＲ処理によりテキスト化する。テキスト化されたデータは、所定の形式で項目（フィールド）に分類されてファイル化される。このＯＣＲ処理では、高い文字認識率を実現することが好ましい。 First, the personal information image of the receipt is converted into text by OCR processing and converted into a unique code by encryption technology. Subsequently, this unique code is combined with the image data of the medical history image of the receipt. As a result, the operator cannot identify an individual even when viewing the combined data, and can associate the encrypted personal information with the medical treatment details information. Next, the medicine history image is converted into text by OCR processing. The text data is classified into items (fields) in a predetermined format and is filed. In this OCR process, it is preferable to realize a high character recognition rate.

（４）パンチ処理
パンチ処理（Ｓ１６）では、ＯＣＲ処理でテキスト化できなかった箇所または誤ったテキスト化がなされた箇所を、パンチャが入力または修正する。ＯＣＲ処理の精度が高くなるほどパンチャの作業量は減ることになり、したがって、紙レセプトのデータ化にかかるトータルコストを抑えることが可能となる。 (4) Punching process In the punching process (S16), the puncher inputs or corrects a part that cannot be converted to text by the OCR process or a part that has been erroneously converted to text. As the accuracy of the OCR process increases, the work amount of the puncher decreases, so that it is possible to reduce the total cost for converting the paper receipt into data.

Ｓ１０〜Ｓ１６のステップは、テキストデータ生成システムにより実行される。紙レセプトのテキストデータはファイルとしてまとめられ、このデータファイルが、Ｓ１８以降の処理を実行するレセプトファイル生成システムに引き渡される。 Steps S10 to S16 are executed by the text data generation system. The text data of the paper receipt is collected as a file, and this data file is delivered to a receipt file generation system that executes the processing after S18.

（５）論理チェック処理
論理チェック処理（Ｓ１８）では、テキスト化されたデータの論理チェックを行う。コンピュータが論理チェックを実行し、レセプト中の論理的なエラーを検出する。論理的なエラーとは、例えば患者の誕生日が未来の日付になっているような誤りである。 (5) Logic Check Process In the logic check process (S18), the logic check of the text data is performed. The computer performs a logic check and detects a logical error during reception. A logical error is an error in which the patient's birthday is a future date, for example.

（６）対応チェック処理
対応チェック処理（Ｓ２０）では、データファイルにおけるテキストデータ間の対応関係をチェックする。ここでは、例えば、診療開始日と２回目以降の診療日との前後関係が逆であったり、薬歴の摘要欄に医薬品名が存在するものの、使用量が存在しなかったりという誤りをチェックする。また、数字が入力されるべき項目に、文字が入力されているような誤りもチェックする。コンピュータは、データ間の対応関係を予め保持しておき、この対応関係の適合の可否を判断することで、対応関係のエラーを検出する。検出されたエラーは、オペレータに通知される。オペレータはエラー内容を見て、正しい内容に修正する。コンピュータは、データ間の対応関係だけでなく、レセプト間の対応関係もチェックしてよい。例えば、調剤レセプトがあるのに、医科レセプトが存在しない場合、コンピュータは、その旨をオペレータに通知する。 (6) Correspondence Check Processing In the correspondence check processing (S20), the correspondence between text data in the data file is checked. Here, for example, check for errors such as the reverse relationship between the medical treatment start date and the second and subsequent medical treatment dates, or the fact that the drug name is present in the summary column of the drug history but the usage amount does not exist . Also, check for errors such as characters being entered in the item where the number should be entered. The computer retains the correspondence relationship between the data in advance, and determines whether or not the correspondence relationship can be matched, thereby detecting a correspondence relationship error. The detected error is notified to the operator. The operator looks at the error content and corrects it to the correct content. The computer may check not only the correspondence between the data but also the correspondence between the receipts. For example, if there is a dispensing receipt but no medical receipt, the computer notifies the operator to that effect.

（７）テキストデータ分解処理
テキストデータ分解処理（Ｓ２２）では、テキスト化された文字列を分類して区分けする。例えば、医科レセプトの摘要欄データを、医薬品、医療材料、診療行為に分類し、さらに医薬品、医療材料、診療行為の複数項目が１行のテキストデータとして存在している場合に、それらを項目ごとに分解する。具体的には、文字列に含まれるカンマや空白（ブランク）、改行などを検出して、文字列を医療関係用語ごとに区分けしていく。また、連続した文字列と、その文字列を複数の医療関係用語に対応付けた分解テーブルを参照して、文字列を医療関係用語に分解してもよい。例えば、傷病名と医薬品名が連続した文字列としてテキスト化されている場合に、分解テーブルは、その文字列を、傷病名と医薬品名とに対応付けて記憶している。 (7) Text Data Decomposition Processing In the text data decomposition processing (S22), text strings converted into text are classified and classified. For example, if the medical receipt summary column data is classified into pharmaceuticals, medical materials, and medical practices, and multiple items of pharmaceuticals, medical materials, and medical practices exist as a single line of text data, they are classified by item. Disassembled into Specifically, commas, blanks (blanks), line breaks, and the like included in the character string are detected, and the character string is divided into medical terms. Further, the character string may be decomposed into medical terms by referring to a continuous character string and a decomposition table in which the character strings are associated with a plurality of medical terms. For example, when the injury and illness name and the medicine name are converted into text as a continuous character string, the decomposition table stores the character string in association with the injury and illness name and the medicine name.

（８）テキストデータ結合処理
テキストデータ結合処理（Ｓ２３）では、テキストデータ分解処理（Ｓ２２）において空白や改行などにより分解された文字列のうち、医療関係用語として抽出されなかった文字列同士を結合して、１つの医療関係用語を抽出する。このとき、複数の文字列を結合して生成される医療関係用語をリスト化した結合テーブルを参照する。例えば、１つの医薬品名が、間に空白を入れてテキスト化されている場合に、結合テーブルは、その医薬品名を保持して記憶しており、分解処理された複数の文字列を結合することで結合テーブルに保持した医薬品名と一致した場合には、その複数の文字列を結合して、１つの医薬品名を抽出する。 (8) Text data combining process In the text data combining process (S23), character strings that have not been extracted as medical terms are combined among the character strings decomposed by blanks or line breaks in the text data decomposition process (S22). Then, one medical related term is extracted. At this time, a combined table in which medical related terms generated by combining a plurality of character strings are listed is referred to. For example, if a single drug name is made into text with a space in between, the combination table holds the drug name and stores it, and combines a plurality of decomposed character strings. If the drug names stored in the combination table match, the plurality of character strings are combined to extract one drug name.

（９）分解テーブル、結合テーブルの更新
分解テーブル、結合テーブルの更新処理（Ｓ２４）では、Ｓ２２のテキストデータ分解処理において分解できなかったテキストデータを、医療関係用語ごとに区分けして、分解テーブルの拡張を行い、また、Ｓ２３のテキストデータ結合処理において結合できなかった複数のテキストデータを１つの医療関係用語として結合して、結合テーブルの拡張を行う。この作業は人手によって行われる。 (9) Update of disassembly table and join table In the update process of disassembly table and join table (S24), the text data that could not be decomposed in the text data decomposition process of S22 is divided into medical terms, Expansion is performed, and a plurality of text data that could not be combined in the text data combination processing of S23 is combined as one medical related term to expand the combination table. This work is performed manually.

オペレータは、Ｓ２２において区分け不能な文字列を複数の医療関係用語に分解して、分解テーブルの登録内容を適宜補充していく。特に、大規模な医療機関で利用されるレセプトコンピュータは、独自の仕様でカスタマイズされていることがある。そのため、レセプトによっては、複数の医薬品名が連続して記入されたり、また傷病名と医薬品名とが連続して記入されていることもある。オペレータはこのような文字列を見つけると、対応する医療関係用語ごとに区分けして、分解テーブルの登録内容を増やしていく。これにより、次回実行するテキストデータの分解処理の信頼性を、前回よりも確実に高めることができ、処理時間を短縮することができる。 In S22, the operator disassembles the character string that cannot be classified into a plurality of medical terms, and replenishes the content of the disassembly table as appropriate. In particular, a receipt computer used in a large-scale medical institution may be customized with unique specifications. For this reason, depending on the receipt, a plurality of drug names may be entered in succession, or a wound name and a drug name may be entered in succession. When the operator finds such a character string, the operator categorizes each corresponding medical term and increases the registration contents of the disassembly table. As a result, the reliability of the text data decomposition process to be executed next time can be reliably increased as compared to the previous time, and the processing time can be shortened.

また、オペレータは、Ｓ２３において結合できなかった医療関係用語を結合テーブルに適宜登録していく。医療関係用語の文字数は様々であるが、特に長い文字列となる医療関係用語については、間に空白が挿入されたり、摘要欄において改行されて記入されることが多い。基本的に、テキストデータはＳ２２において空白部分や改行部分で分解されるが、この分解処理では、本来１つの医療関係用語であるにもかかわらず、それが不必要に分解されて１つの医療関係用語として特定できない結果を招くこともある。そのような場合の対応として、オペレータは、分解処理される１つの医療関係用語を結合テーブルに登録しておき、Ｓ２３における結合処理の精度を高めていく。結合テーブルを適宜更新していくことで、テキストデータの抽出処理の信頼性を前回よりも高めることができ、処理時間を短縮することができる。 Further, the operator appropriately registers medical related terms that could not be combined in S23 in the combination table. The number of characters in medical terms varies, but especially for medical terms that are long strings, a space is often inserted between them or a line break is entered in the summary column. Basically, the text data is decomposed at the blank portion and the line feed portion at S22, but in this decomposition processing, although it is originally one medical related term, it is unnecessarily decomposed to one medical related term. It may lead to results that cannot be specified as terms. As a countermeasure for such a case, the operator registers one medical-related term to be decomposed in the combination table, and increases the accuracy of the combination process in S23. By appropriately updating the join table, the reliability of the text data extraction process can be improved compared to the previous time, and the processing time can be shortened.

（１０）辞書テーブルの読出
辞書テーブルの読出処理（Ｓ２６）では、格納部に記憶されている辞書テーブルを読み出す。医療関係用語には同じ意味を表す表現が複数存在することがあり、例えば、傷病名の「虫垂炎」、「盲腸」、「アッペ」は全て同じ傷病を意味する。用語の不統一は、後の統計処理などを実行する際の阻害要因となるため、レセプトファイル生成システムでは、レセプトデータの有効利用を図るべく、複数の同義語のうちの一つを「標準語」として設定し、標準語以外の同義語を「方言」と設定して取り扱うこととしている。標準語は、その識別コードおよび属性情報に対応付けられて、マスタテーブルに記憶されている。以下に、マスタテーブルと辞書テーブルとの関係を示す。 (10) Reading Dictionary Table In the dictionary table reading process (S26), the dictionary table stored in the storage unit is read. There may be a plurality of expressions representing the same meaning in medical-related terms. For example, the names of appendices “appendicitis”, “cecum”, and “appe” all mean the same injury. Terminology inconsistency becomes a hindrance to subsequent statistical processing, etc., so in the Receipt File Generation System, one of multiple synonyms is designated as “Standard Word” in order to effectively use the receipt data. ”And synonyms other than the standard language are set as“ dialect ”and handled. The standard word is stored in the master table in association with the identification code and attribute information. The relationship between the master table and the dictionary table is shown below.

マスタテーブルは、システムで標準語として採用する傷病名、医薬品名などの医療関係用語と、その医療関係用語の識別コード、およびその属性情報とを対応付けて生成される。例えば傷病名に関していえば、「疾病、傷害及び死因分類」に分類されている傷病名を標準語として設定してもよい。識別コードは、マスタテーブルと辞書テーブルとを紐付けするために用いられ、レセプトファイル生成システムにおいて独自に設定したものを用いてもよい。また、医療業界に各種存在するコード体系における識別コードを、マスタテーブルと辞書テーブルの紐付け用の識別コードとして転用してもよい。 The master table is generated by associating medical-related terms such as injury and illness names and drug names adopted as standard words in the system with identification codes of the medical-related terms and attribute information thereof. For example, as for the names of wounds and sicknesses, the names of wounds and sicknesses classified as “classification of disease, injury and death” may be set as standard words. The identification code is used for associating the master table with the dictionary table, and an identification code uniquely set in the receipt file generation system may be used. Also, identification codes in various code systems existing in the medical industry may be diverted as identification codes for linking the master table and the dictionary table.

具体的に、傷病名「虫垂炎」を標準語と設定する場合、マスタテーブルは、「虫垂炎」を、その識別コードおよびその属性情報と対応付けて記憶する。属性情報は、虫垂炎のＩＣＤ分類などの情報を含む。医薬品名や他の区分のマスタテーブルについても同様に、標準語、識別コードおよび属性情報とが対応付けられる。医薬品名の属性情報は、薬価（保険点数）を含む。マスタテーブルは、傷病名を標準化した傷病マスタテーブル、医薬品を標準化した医薬品マスタテーブルなど、複数の区分に対して作成されている。 Specifically, when the wound name “appendicitis” is set as a standard word, the master table stores “appendicitis” in association with its identification code and its attribute information. The attribute information includes information such as ICD classification of appendicitis. Similarly, the standard word, the identification code, and the attribute information are associated with the master table of the drug name and other categories. The drug name attribute information includes a drug price (insurance score). The master table is created for a plurality of sections, such as a sickness and disease master table that standardizes wound names and a pharmaceutical product master table that standardizes medicines.

辞書テーブルは、医療関係用語として標準語と同じ意味を表す同義語を、標準語の識別コードに対応付けることで生成される。同義語は、標準語以外の同義語（方言）を含み、また標準語自身を含んでもよい。具体的には、「虫垂炎」の識別コードに対して、「虫垂炎」、「盲腸」、「アッペ」を対応付けて記憶するのが辞書テーブルである。ここで、虫垂炎は標準語であり、盲腸、アッペは方言である。辞書テーブルは、マスタテーブルに対応して、傷病名を辞書化した傷病辞書テーブル、医薬品を辞書化した医薬品辞書テーブルなど、複数の区分に対して作成されている。 The dictionary table is generated by associating a synonym representing the same meaning as a standard word as a medical-related term with an identification code of the standard word. The synonym includes a synonym (dialect) other than the standard word, and may include the standard word itself. Specifically, the dictionary table stores “appendicitis”, “cecum”, and “upe” in association with the identification code of “appendicitis”. Here, appendicitis is a standard language, and the cecum and appe are dialects. The dictionary table is created for a plurality of sections such as a disease / disease dictionary table in which the names of wounds and diseases are converted into a dictionary and a drug dictionary table in which drugs are converted into a dictionary corresponding to the master table.

（１１）マッチング処理
マッチング処理（Ｓ２８）では、項目ごとに分解されたテキストデータと、読み出した辞書テーブルのデータとのマッチングをとる。テキストデータが辞書テーブルに登録されたデータと一致する場合、そのデータに対応付けられている識別コードを読み出し、続くレセプトファイル生成処理に引き渡す。 (11) Matching process In the matching process (S28), matching is performed between the text data decomposed for each item and the data of the read dictionary table. When the text data matches the data registered in the dictionary table, the identification code associated with the data is read and transferred to the subsequent receipt file generation process.

なお、虫垂炎を表現する盲腸、アッペ以外の別の名前がテキストデータとして記述されている場合、この新しい名前は辞書テーブルに登録されていないため、コンピュータは、その名前を虫垂炎の方言として認識できない。辞書テーブルに対応する名前が存在しない場合、その名前を不明データとして所定の格納領域に記憶し、その旨がオペレータに出力される。 If another name other than the cecum and appe that expresses appendicitis is described as text data, the new name is not registered in the dictionary table, and the computer cannot recognize the name as a dialectitis of appendicitis. If there is no name corresponding to the dictionary table, the name is stored as unknown data in a predetermined storage area, and a message to that effect is output.

（１２）マスタテーブル、辞書テーブル更新
マスタテーブル、辞書テーブルの更新（Ｓ３０）では、まず、オペレータが、不明データとして所定の格納領域に記憶された名前を確認する。この確認作業は、不明データが発生する度に行ってもよく、また複数の不明データがまとまった段階で行ってもよい。オペレータは、不明データが虫垂炎の新しい呼び名であることを判断すると、その呼び名を虫垂炎の方言として追加し、辞書テーブルを更新する。なお、新薬がでた場合、または新しい傷病が発生した場合、オペレータは、新たな医療関係用語に識別コードを設定して、マスタテーブルおよび辞書テーブルを更新する。 (12) Master Table / Dictionary Table Update In updating the master table / dictionary table (S30), first, the operator confirms the name stored in the predetermined storage area as unknown data. This confirmation operation may be performed each time unknown data is generated, or may be performed when a plurality of unknown data are collected. When the operator determines that the unknown data is a new name for appendicitis, the operator adds the name as an appendicitis dialect and updates the dictionary table. In addition, when a new medicine comes out or when a new injury occurs, the operator sets an identification code for a new medical term and updates the master table and the dictionary table.

（１３）一時辞書テーブル生成
一時辞書テーブル生成処理（Ｓ３２）では、一時的に利用される辞書テーブルを生成する。テキストデータが誤記であったり、ある医療機関でのみ使用される特殊な表現であるような場合、オペレータは、そのテキストデータで示す文字列を、標準語の識別コードと対応付ける一時辞書テーブルを生成する。一時辞書テーブルは、例えば当月に限って利用される。 (13) Temporary Dictionary Table Generation In the temporary dictionary table generation process (S32), a temporary dictionary table is generated. When the text data is erroneously written or is a special expression used only in a certain medical institution, the operator generates a temporary dictionary table that associates the character string indicated by the text data with the standard word identification code. . The temporary dictionary table is used only for the current month, for example.

（１４）レセプトファイル生成
レセプトファイル生成処理（Ｓ３４）では、テキストデータ中の方言や特殊な表現を標準語に紐付けたレセプトファイルを生成する。具体的にレセプトファイルでは、テキストデータ中の表現に対して、マスタテーブルで使用する標準語の識別コードをリンクさせる。このレセプトファイルは、もとの紙レセプトに記入されていた内容をそのまま残し、含まれる方言や特殊な表現については、辞書テーブルを参照することで、標準語に対応付けて構成される。 (14) Receipt file generation In the receipt file generation process (S34), a receipt file in which dialects and special expressions in the text data are linked to standard words is generated. Specifically, in the receipt file, an identification code of a standard word used in the master table is linked to the expression in the text data. This receipt file is made up of the contents entered in the original paper receipt as it is, and the dialects and special expressions contained therein are configured in correspondence with standard words by referring to the dictionary table.

（１５）データ加工
データ加工処理（Ｓ３６）では、生成したレセプトファイルをもとに、統計的な処理や、予測医学など、ユーザのニーズに合わせた様々な処理を実行する。これは、レセプトファイル中の医療関係用語が標準語に紐付けされていることで可能となり、標準語の識別コードをキーとして、様々なデータ加工が可能となる。また、各レセプトファイルは、患者個人にも紐付けされているため、患者の傷病履歴の把握や、予測医療が可能となる。Ｓ１８〜Ｓ３６のステップは、レセプトファイル生成システムにより実行される。 (15) Data processing In the data processing process (S36), various processes according to the user's needs such as statistical processing and predictive medicine are executed based on the generated receipt file. This is possible because medical terms in the receipt file are linked to standard words, and various data processing is possible using the standard word identification code as a key. In addition, since each receipt file is associated with an individual patient, it is possible to grasp the patient's injury history and predict medical care. Steps S18 to S36 are executed by the receipt file generation system.

図２は、本発明の実施例におけるレセプト処理システム１を示す。レセプト処理システム１は、紙レセプトのイメージデータから文字を読み取ってテキストデータを生成するテキストデータ生成システム２と、テキストデータ生成システム２において生成されたテキストデータから方言などを標準語に紐付けしたレセプトファイルを生成するレセプトファイル生成システム３を備える。テキストデータ生成システム２は、図１における紙レセプトのイメージ化処理（Ｓ１２）からパンチ処理（Ｓ１６）までの処理ステップを実行する。また、レセプトファイル生成システム３は、論理チェック処理（Ｓ１８）からデータ加工処理（Ｓ３６）までの処理ステップを実行する。テキストデータ生成システム２およびレセプトファイル生成システム３は、同一の主体により管理、運営されてもよく、また別主体が提携することで共同運営されてもよい。 FIG. 2 shows a receipt processing system 1 according to an embodiment of the present invention. The receipt processing system 1 includes a text data generation system 2 that reads text from image data of a paper receipt and generates text data, and a receipt that associates dialects and the like with standard words from the text data generated in the text data generation system 2. A receipt file generation system 3 for generating a file is provided. The text data generation system 2 executes processing steps from the paper receipt imaging process (S12) to the punching process (S16) in FIG. The receipt file generation system 3 executes processing steps from the logic check process (S18) to the data processing process (S36). The text data generation system 2 and the receipt file generation system 3 may be managed and operated by the same entity, or may be jointly operated by another entity in cooperation.

本実施例におけるレセプト処理システム１の機能は、テキストデータ生成システム２およびレセプトファイル生成システム３において、ＣＰＵ、メモリ、メモリにロードされたプログラムなどによって実現される。プログラムは、テキストデータ生成システム２およびレセプトファイル生成システム３に内蔵されていてもよく、また記録媒体に格納された形態で外部から供給されるものであってもよい。したがってこれらの機能ブロックがハードウエアのみ、ソフトウエアのみ、またはそれらの組合せによっていろいろな形で実現できることは、当業者に理解されるところである。 The functions of the receipt processing system 1 in this embodiment are realized by the CPU, the memory, a program loaded in the memory, and the like in the text data generation system 2 and the receipt file generation system 3. The program may be built in the text data generation system 2 and the receipt file generation system 3, or may be supplied from the outside in a form stored in a recording medium. Accordingly, those skilled in the art will understand that these functional blocks can be realized in various forms by hardware only, software only, or a combination thereof.

図３は、テキストデータ生成システム２の構成を示す。テキストデータ生成システム２は、イメージデータ生成部１０、ＯＣＲ部１２、入力部１６、ディスプレイ１８、ドライブ装置２０および格納部２２を備える。入力部１６は、テキストデータ生成システム２の入力インタフェースであり、例えばキーボードや、マウスなどのポインティングデバイスなどにより構成される。入力部１６は、例えばディスプレイ１８に設けられるタッチパネルとして構成されてもよい。ドライブ装置２０は、ＤＶＤやＣＤなどの記憶媒体３０のデータ書込および／またはデータ読出を行う装置である。 FIG. 3 shows a configuration of the text data generation system 2. The text data generation system 2 includes an image data generation unit 10, an OCR unit 12, an input unit 16, a display 18, a drive device 20, and a storage unit 22. The input unit 16 is an input interface of the text data generation system 2 and includes, for example, a keyboard and a pointing device such as a mouse. The input unit 16 may be configured as a touch panel provided on the display 18, for example. The drive device 20 is a device that performs data writing and / or data reading from a storage medium 30 such as a DVD or a CD.

イメージデータ生成部１０はスキャナであり、紙レセプトを入力されて、紙レセプトのイメージデータ（レセプト画像）を生成する。イメージデータは、個人情報を記入された個人情報画像と、全画像から個人情報画像を除いた薬歴画像に分けられる。オペレータがディスプレイ１８を見ながら、入力部１６のポインティングデバイスを用いてレセプト画像の範囲指定をすることで、個人情報画像と薬歴画像とが分割されてもよい。 The image data generation unit 10 is a scanner, and receives paper receipt and generates image data (receipt image) of the paper receipt. The image data is divided into a personal information image in which personal information is entered and a medicine history image obtained by removing the personal information image from all images. The personal information image and the medicine history image may be divided by the operator specifying the range of the receipt image using the pointing device of the input unit 16 while looking at the display 18.

図４は、診療報酬明細書（レセプト）のイメージデータを示す。このレセプトは、医科レセプトの一例である。レセプト画像３２は、一点鎖線３３の上方にある個人情報画像３４と、下方にある薬歴画像３６とに分けられる。薬歴画像３６には、医薬品、医療材料、診療行為などが記述された摘要欄３８が存在する。 FIG. 4 shows image data of a medical fee remuneration statement (receipt). This receipt is an example of a medical receipt. The receipt image 32 is divided into a personal information image 34 above the one-dot chain line 33 and a medicine history image 36 below. The medicine history image 36 includes a summary column 38 in which medicines, medical materials, medical treatments, and the like are described.

摘要欄３８において、左端の数字は、診療区分コードを示し、アスタリスクは、同時処方であることを示す。１つのアスタリスクでブロック化される医薬品、医療材料、診療行為の一群は、同時処方された医薬品、医療材料、診療行為であることを意味し、保険点数に関して言えば、単純にそれぞれを加算した額となるのではなく、減額の対象となる。右端の数字は、回数を示し、その左にある「×」の左側の数字は、点数を示す。「点数×回数」により、そのブロックの保険点数が定まる。点数の左側にある文字列のうち、同時処方欄または診療区分コードに含まれない文字列は、医薬品名、医療材料名、診療行為名を示す。 In the summary column 38, the number at the left end indicates the medical treatment division code, and the asterisk indicates simultaneous prescription. A group of pharmaceuticals, medical materials, and medical practices that are blocked by one asterisk means that they are simultaneously prescribed pharmaceuticals, medical materials, and medical practices. In terms of insurance points, simply add each of them. Rather than become a subject of reduction. The number on the right end indicates the number of times, and the number on the left side of “x” on the left side indicates the score. The insurance score of the block is determined by “number of points × number of times”. Among the character strings on the left side of the score, the character strings that are not included in the simultaneous prescription column or the medical treatment classification code indicate the drug name, the medical material name, and the medical practice name.

図３に戻って、ＯＣＲ部１２は、まず個人情報画像３４から光学的文字認識による文字の読み取りを行う。個人情報画像３４の読み取り結果は暗号化され、薬歴画像３６に対応付けられる。 Returning to FIG. 3, the OCR unit 12 first reads characters from the personal information image 34 by optical character recognition. The reading result of the personal information image 34 is encrypted and associated with the medicine history image 36.

次に、ＯＣＲ部１２は、薬歴画像３６から光学的文字認識による文字の読み取りを行う。薬歴画像３６のうち、摘要欄３８は、使用した医薬品、診療行為などの保険点数を表記しており、支払金額に直接関係するところである。そのため、ＯＣＲ部１２は、精度よく摘要欄３８を読み取れることが必要となる。 Next, the OCR unit 12 reads characters from the medicine history image 36 by optical character recognition. In the medicine history image 36, the summary column 38 indicates insurance points such as used medicines and medical treatments, and is directly related to the payment amount. Therefore, the OCR unit 12 needs to be able to read the summary column 38 with high accuracy.

ＯＣＲ部１２において読み取られた薬歴画像３６のテキストデータが、ディスプレイ１８に表示され、オペレータ（パンチャ）は、表示されたテキストデータを修正する。ＯＣＲ部１２の文字認識率が高いほどオペレータの作業負担が軽くなり、紙レセプトのテキスト化にかかるコストを低くできる。 The text data of the medicine history image 36 read by the OCR unit 12 is displayed on the display 18, and the operator (puncher) corrects the displayed text data. The higher the character recognition rate of the OCR unit 12, the lighter the operator's workload, and the lower the cost of making paper receipt text.

図５は、摘要欄３８を文字認識して、項目ごとに分類して生成したデータファイルの表示例を示す。パンチャは、ディスプレイ１８に表示されるデータファイルと、もとの摘要欄３８の画像とを見比べながら、入力部１６を操作して、データファイルのデータ修正を行う。図５には、文字認識がうまくできた例を示しているが、文字の誤認識がある場合は、パンチャが適宜修正していく。 FIG. 5 shows a display example of a data file generated by recognizing the summary column 38 and classifying it for each item. The puncher corrects the data file by operating the input unit 16 while comparing the data file displayed on the display 18 with the image in the original summary field 38. FIG. 5 shows an example of successful character recognition. However, if there is a misrecognition of characters, the puncher corrects it appropriately.

パンチャによる修正が終了したデータファイルは、格納部２２に格納される。ドライブ装置２０は、レセプトのデータファイルを記憶媒体３０に記録する。記憶媒体３０は、レセプトファイル生成システム３に引き渡される。 The data file that has been corrected by the puncher is stored in the storage unit 22. The drive device 20 records the receipt data file in the storage medium 30. The storage medium 30 is delivered to the receipt file generation system 3.

図６は、レセプトファイル生成システム３の構成を示す。レセプトファイル生成システム３は、テキストデータ処理部５０、マッチング処理部５２、レセプトファイル生成部５４、論理チェック処理部５５、対応チェック処理部５６、データ加工部５８、入力部６０、ディスプレイ６２、参照テーブル格納部６４、データ格納部６６およびドライブ装置６８を備える。入力部６０は、レセプトファイル生成システム３の入力インタフェースであり、例えばキーボードや、マウスなどのポインティングデバイスなどにより構成される。入力部６０は、例えばディスプレイ１８に設けられるタッチパネルとして構成されてもよい。ドライブ装置６８は、ＤＶＤやＣＤなどの記憶媒体３０のデータ書込および／またはデータ読出を行う装置である。テキストデータ生成システム２およびレセプトファイル生成システム３とが一体のレセプト処理システム１として構成される場合、入力部、ディスプレイ、ドライブ装置、格納部などの構成が共用されてもよい。参照テーブル格納部６４は、マスタテーブル格納部７０、辞書テーブル格納部７２、一時辞書テーブル格納部７４、分解テーブル格納部７６、結合テーブル格納部７７、振分テーブル格納部７８を備えて構成される。 FIG. 6 shows the configuration of the receipt file generation system 3. The receipt file generation system 3 includes a text data processing unit 50, a matching processing unit 52, a receipt file generation unit 54, a logic check processing unit 55, a correspondence check processing unit 56, a data processing unit 58, an input unit 60, a display 62, and a reference table. A storage unit 64, a data storage unit 66, and a drive device 68 are provided. The input unit 60 is an input interface of the receipt file generation system 3 and includes, for example, a keyboard and a pointing device such as a mouse. The input unit 60 may be configured as a touch panel provided on the display 18, for example. The drive device 68 is a device that performs data writing and / or data reading of the storage medium 30 such as a DVD or a CD. When the text data generation system 2 and the receipt file generation system 3 are configured as an integrated reception processing system 1, the configurations of an input unit, a display, a drive device, a storage unit, and the like may be shared. The reference table storage unit 64 includes a master table storage unit 70, a dictionary table storage unit 72, a temporary dictionary table storage unit 74, a decomposition table storage unit 76, a combined table storage unit 77, and a distribution table storage unit 78. .

レセプトのデータファイルを記録した記憶媒体３０がドライブ装置６８に挿入され、データファイルがデータ格納部６６に書き込まれる。論理チェック処理部５５が、データ格納部６６からデータファイルを読み出し、テキスト化されたデータの論理チェックを行う。例えば、患者の誕生日が未来日付となっている場合、論理チェック処理部５５は、そのエラーを検出して、オペレータに通知する。オペレータは、その通知を受けて、エラー部分を修正する。 The storage medium 30 in which the receipt data file is recorded is inserted into the drive device 68, and the data file is written into the data storage unit 66. The logic check processing unit 55 reads the data file from the data storage unit 66 and performs a logic check on the text data. For example, when the patient's birthday is a future date, the logic check processing unit 55 detects the error and notifies the operator. The operator receives the notification and corrects the error part.

対応チェック処理部５６は、テキストデータ間の対応関係をチェックする。図５を参照して、医療材料名「画像記録用フィルム（１）半切１枚」と、点数（２２３８）、回数（１）とは、互いに対応付けられるべきデータであるが、図５のデータファイルでは異なる行に存在する。そのため、対応チェック処理部５６は、これらが同じ行となるように修正する。「超音波検査（断層撮影法）・胸腹部」と、その点数（５５０）、回数（１）についても同様である。 The correspondence check processing unit 56 checks the correspondence between the text data. Referring to FIG. 5, the medical material name “image recording film (1) one half cut”, the score (2238), and the number of times (1) are data that should be associated with each other. It exists on a different line in the file. Therefore, the correspondence check processing unit 56 corrects them so that they are on the same line. The same applies to “ultrasound examination (tomography) / chest and abdomen”, its score (550), and number of times (1).

このようなチェックを実行するために、対応チェック処理部５６は、データ項目の対応関係を予め保持しておく。この対応関係は、データ格納部６６に記憶されていてもよい。例えば、図５に示す摘要欄のデータでは、「同」欄、「薬名等」欄、「点数」欄、「回数」欄のデータ項目がそれぞれ対応付けられて存在する必要があり、これらの対応関係が満たされていない場合には、対応チェック処理部５６が、データエラーを検出する。検出されたエラーは、ディスプレイ６２を通じてオペレータに通知される。オペレータはエラー内容を見て、データ同士を正しい対応関係に修正する。さらに対応チェック処理部５６は、既述したように、診療開始日と２回目以降の診療日との前後関係が逆であったり、数字が入力されるべき項目に文字が入力されているような誤りもチェックする。以上の論理チェックおよび対応チェックを行うことで、データファイルに含まれるエラーを取り除くことができる。 In order to execute such a check, the correspondence check processing unit 56 holds a correspondence relationship between data items in advance. This correspondence relationship may be stored in the data storage unit 66. For example, in the data in the summary column shown in FIG. 5, the data items in the “same” column, “medicine name etc.” column, “score” column, and “number of times” column must be associated with each other. If the correspondence relationship is not satisfied, the correspondence check processing unit 56 detects a data error. The detected error is notified to the operator through the display 62. The operator looks at the error content and corrects the data to the correct correspondence. Furthermore, as described above, the correspondence check processing unit 56 has a reverse relationship between the medical treatment start date and the second and subsequent medical treatment dates, or characters are entered in items to which numbers are to be input. Check for errors. By performing the above logic check and correspondence check, errors contained in the data file can be removed.

テキストデータ処理部５０が、データファイルに含まれるテキストデータを分類して区分けし、最終的に生成するレセプトファイルのデータ項目まで分解したテキストデータを抽出する。例えば、医科レセプトの摘要欄データを、医薬品、医療材料、診療行為に分類し、さらに医薬品、医療材料、診療行為の複数項目が１行のテキストデータとして存在している場合に、それらを所期の項目ごとに分解する。また、医薬品として記されている内容が、医薬品名と量とを示す場合も、これらを分解する処理を行う。例えば、図５に示す「ルプラック錠４ｍｇ１錠」は、ルプラック錠４ｍｇ（医薬品名）と１錠（量）とに分解される。まず、テキストデータ処理部５０は、文字列に含まれるカンマや空白（ブランク）を検出して、文字列を医療関係用語ごとに区分けする。このとき、診療区分コードをもとに文字列を区分けしてもよい。 The text data processing unit 50 classifies and classifies the text data included in the data file, and extracts the text data that has been decomposed up to the data items of the receipt file to be finally generated. For example, if the medical receipt summary column data is classified into pharmaceuticals, medical materials, and medical practices, and multiple items of pharmaceuticals, medical materials, and medical practices exist as a single line of text data, they are Disassemble each item. In addition, when the contents written as a medicine indicate the name and amount of the medicine, a process for decomposing them is performed. For example, “Luplac tablets 4 mg, 1 tablet” shown in FIG. 5 is broken down into Luplac tablets 4 mg (pharmaceutical name) and 1 tablet (amount). First, the text data processing unit 50 detects a comma or a blank (blank) included in the character string, and classifies the character string for each medical term. At this time, the character string may be divided based on the medical treatment division code.

本実施例において、テキストデータ処理部５０は、分解テーブルを参照することで、文字列のテキストデータを分解し、文字列から医療関係用語のテキストデータを抽出してもよい。分解テーブルは、分解テーブル格納部７６に格納されており、連続した文字列を、その文字列を区切って分解した複数の医療関係用語に対応付けている。この対応付けは、図１のＳ２４に関して説明したようにオペレータにより予め作成されており、複数の医療関係用語が繋がって表記された新たな文字列が登場した場合には、オペレータが、その文字列を医療関係用語に分解して、分解テーブルを逐次作成していく。これにより、以後のタイミングで同じ文字列が登場した場合、分解テーブルを参照することで、テキストデータ処理部５０が、文字列を適切な医療関係用語に分解できる。分解されたデータは所定のファイルに記録される。 In this embodiment, the text data processing unit 50 may decompose the text data of the character string by referring to the decomposition table, and extract the text data of the medical terms from the character string. The disassembly table is stored in the disassembly table storage unit 76, and a continuous character string is associated with a plurality of medical terms that are decomposed by dividing the character string. This association is created in advance by the operator as described with reference to S24 in FIG. 1, and when a new character string in which a plurality of medical terms are connected appears, the operator selects the character string. Is decomposed into medical terms and a disassembly table is created sequentially. As a result, when the same character string appears at a later timing, the text data processing unit 50 can decompose the character string into appropriate medical terms by referring to the decomposition table. The decomposed data is recorded in a predetermined file.

なお、このファイルには、アスタリスクにより特定される１ブロックの関係が損なわれることなく記録される。既述したように、アスタリスクは、減額対象となる同時処方を示すものであるため、１ブロックの関係を保つことで、保険点数との対応関係を維持できる。図５において、１つのアスタリスクに検査方法「シンチグラム（全身）（骨）」と、医療材料「画像記録用フィルム（１）半切１枚」とが対応付けられているが、この検査方法と医療材料に対して、２３３８×１の保険点数が算出されており、したがって、これらの関係を維持した状態で、分解されたデータがファイルに記録される必要がある。 This file is recorded without damaging the relationship of one block specified by the asterisk. As described above, since the asterisk indicates the simultaneous prescription to be reduced, the correspondence with the insurance score can be maintained by maintaining the relationship of one block. In FIG. 5, an inspection method “scintigram (whole body) (bone)” and a medical material “image recording film (1) one piece” are associated with one asterisk. An insurance score of 2338 × 1 is calculated for the material, and therefore, the decomposed data needs to be recorded in a file while maintaining these relationships.

テキストデータ処理部５０により抽出された文字列が登録された医療関係用語であるか否かを判断するために、マッチング処理部５２は、振分テーブル格納部７８に登録された医療関係用語を参照する。振分テーブルは、全ての種類の辞書テーブルに格納されている全ての医療関係用語と、参照する辞書テーブルの種類とを対応づけて構成されている。したがって、マッチング処理部５２は、振分テーブルを参照することで、テキストデータ処理部５０により抽出された文字列が医療関係用語であるか否かを判断することができ、さらに、医療関係用語である場合に、参照するべき辞書テーブルを特定することが可能となる。これにより、後述するマッチング処理を実行する際に、全ての種類の辞書テーブルに関して抽出した文字列の検索を行うよりも、振分テーブルをもとに辞書を特定することで、マッチング処理時間を短縮することが可能となる。マッチング処理の詳細については後述する。抽出した文字列が医療関係用語でない場合には、テキストデータ処理部５０が、その文字列に対して結合処理を行う。 In order to determine whether or not the character string extracted by the text data processing unit 50 is a registered medical related term, the matching processing unit 52 refers to the medical related term registered in the distribution table storage unit 78. To do. The distribution table is configured by associating all medical terms stored in all types of dictionary tables with the types of dictionary tables to be referred to. Therefore, the matching processing unit 52 can determine whether or not the character string extracted by the text data processing unit 50 is a medical related term by referring to the distribution table. In some cases, it is possible to specify a dictionary table to be referred to. This shortens the matching processing time by specifying the dictionary based on the distribution table rather than searching the extracted character strings for all types of dictionary tables when performing the matching processing described later. It becomes possible to do. Details of the matching process will be described later. If the extracted character string is not a medical term, the text data processing unit 50 performs a combining process on the character string.

テキストデータ処理部５０は、振分テーブルを用いて抽出したテキストデータを医療関係用語として確定した後、医療関係用語として抽出できなかったテキストデータを結合する処理を行う。レセプトは、基本的に、読みやすさのために医療関係用語ごとに空白が挿入されることが多いが、上記したように、場合によっては複数の医療関係用語が連続して記述されることもあり、また逆に、１つの長い医療関係用語などについては間に空白を挿入されることもある。このようなケースでは、テキストデータ処理部５０が、１つの医療関係用語を、複数のテキストデータとして一旦抽出することになるが、これらは結合処理されて、１つの医療関係用語として抽出されることが好ましい。 The text data processing unit 50 performs processing for combining the text data that could not be extracted as medical related terms after the text data extracted using the distribution table is determined as medical related terms. In the reception, blanks are often inserted for each medical term for readability. However, as described above, a plurality of medical terms may be described in succession. On the other hand, a space may be inserted between one long medical term and the like. In such a case, the text data processing unit 50 once extracts one medical-related term as a plurality of text data, but these are combined and extracted as one medical-related term. Is preferred.

そのために、結合テーブル格納部７７が、複数の文字列を結合して生成される医療関係用語をリスト化した結合テーブルを保持する。このリスト化は、図１のＳ２４に関して説明したように予めオペレータにより実行されており、振分テーブルに登録されておらず、且つ空白や改行などにより複数に分割された医療関係用語が新たに登場した場合には、オペレータがその医療関係用語をリストに加えて、結合テーブルを逐次作成していく。 For this purpose, the combined table storage unit 77 holds a combined table that lists medical related terms generated by combining a plurality of character strings. As described with reference to S24 in FIG. 1, this list has been executed by an operator in advance, and medical terms that are not registered in the distribution table and divided into a plurality of spaces or line breaks have appeared. In this case, the operator adds the medical related terms to the list and sequentially creates a connection table.

テキストデータ処理部５０は、振分テーブルに登録されていなかった複数のテキストデータのうち、連続して抽出したものを結合する。１つの医療関係用語が空白や改行などにより分割されて抽出される際、これらは、連続したテキストデータとして抽出されることになる。そのため、テキストデータ処理部５０は、連続して抽出したテキストデータを一旦結合してみて、結合したテキストデータが結合テーブルに登録されている医療関係用語であるか否かを調査する。このとき、結合するテキストデータの数は２つ以上であればよく、その数は問わない。連続して抽出されたテキストデータを結合して結合テーブルを調査した結果、結合テーブルに登録されている医療関係用語であることが判断されると、テキストデータ処理部５０は、その結合したテキストデータを医療関係用語として抽出する。マッチング処理部５２は、振分テーブルを参照して、その医療関係用語に対して参照するべき辞書テーブルを特定する。 The text data processing unit 50 combines continuously extracted text data from a plurality of text data not registered in the distribution table. When one medical-related term is extracted by being divided by a blank or a line feed, these are extracted as continuous text data. Therefore, the text data processing unit 50 tries to combine the extracted text data once, and investigates whether or not the combined text data is a medical related term registered in the combination table. At this time, the number of text data to be combined may be two or more, and the number is not limited. When the extracted text data is joined and the joined table is examined, if it is determined that the term is a medical related term registered in the joined table, the text data processing unit 50 displays the joined text data. Are extracted as medical terms. The matching processing unit 52 refers to the distribution table and specifies a dictionary table to be referred to for the medical related term.

なお、マッチング処理部５２が、結合したテキストデータを振分テーブルを参照することで、医療関係用語であるか否かの判断を行ってもよい。一方、上記したように結合テーブルを利用する場合には、結合テーブルに登録されているデータ量が振分テーブルよりも多くないために、振分テーブルを利用して結合したテキストデータが医療関係用語であるか否かを判断する場合と比較して、処理時間を短縮することができる。 Note that the matching processing unit 52 may determine whether the combined text data is a medical term by referring to the sorting table. On the other hand, when using a combined table as described above, since the amount of data registered in the combined table is not larger than the distribution table, the text data combined using the distribution table is a medical term. Compared with the case of determining whether or not, the processing time can be shortened.

マスタテーブル格納部７０は、レセプトファイル生成システム３において採用する傷病名や医薬品などの名前の標準語を、その識別コードおよび属性情報と対応付けたマスタテーブルを格納する。識別コードは、標準語を特定するための唯一のコードであり、レセプトファイル生成システム３における独自のコードであってよい。傷病に対しては任意の一つの識別コードを割り当ててよいが、医薬品に対しては、時間とともに薬価が変更されることがあるため、属性情報に変更があるたびに新たな識別コードを割り当ててもよい。なお、識別コードを固定とする場合は、マスタテーブルのバージョン管理を行うことで、薬価の変更に対応することも可能である。 The master table storage unit 70 stores a master table in which standard words of names such as sickness names and medicines employed in the receipt file generation system 3 are associated with their identification codes and attribute information. The identification code is the only code for specifying the standard word, and may be a unique code in the receipt file generation system 3. An arbitrary identification code may be assigned to an injury or illness, but since a drug price may change over time for a pharmaceutical product, a new identification code is assigned each time the attribute information changes. Also good. In addition, when fixing an identification code, it is also possible to cope with a change in drug price by performing version management of the master table.

属性情報は、医療関係用語の種類に応じて様々なものが存在し、医療業界で各種存在するコード体系における識別コードを含んでもよい。例えば、マスタテーブルは、傷病名「虫垂炎」を、虫垂炎を特定するＩＣＤ−１０コードや、ＩＣＤ分類に対応付けて記憶する。このＩＣＤ分類は、「消化器系の疾患」である。医薬品名についても同様であり、医薬品の場合は、属性情報として保険点数などをもつ。マスタテーブルは、傷病名を標準化した傷病マスタテーブル、医薬品名を標準化した医薬品マスタテーブルなど、複数の区分に対して作成されている。 There are various types of attribute information depending on the type of medical terms, and the attribute information may include identification codes in various code systems in the medical industry. For example, the master table stores the injury / illness name “appendicitis” in association with an ICD-10 code for identifying appendicitis or an ICD classification. This ICD classification is “digestive diseases”. The same applies to the drug name. In the case of a drug, it has insurance points as attribute information. The master table is created for a plurality of categories, such as an injury and disease master table that standardizes wound names and a pharmaceutical product master table that standardizes drug names.

辞書テーブル格納部７２は、標準語とその方言とを、標準語の同義語として標準語の識別コードに対応付けた辞書テーブルを格納する。本実施例では、「虫垂炎」という傷病の標準語に対して、「盲腸」、「アッペ」を方言としているが、「虫垂炎」、「アッペ」、「盲腸」を、マスタテーブルにおける「虫垂炎」の識別コードに対応付けて記憶したのが辞書テーブルである。辞書テーブルは、マスタテーブルに対応して、傷病名を辞書化した傷病辞書テーブル、医薬品を辞書化した医薬品辞書テーブルなど、複数の区分に対して作成されている。 The dictionary table storage unit 72 stores a dictionary table in which a standard word and its dialect are associated with the identification code of the standard word as a synonym of the standard word. In this example, “cecum” and “appe” are dialects for the standard word of the disease “appendicitis”, but “appendicitis”, “appe”, and “cecum” A dictionary table is stored in association with the identification code. The dictionary table is created for a plurality of sections such as a disease / disease dictionary table in which the names of wounds and diseases are converted into a dictionary and a drug dictionary table in which drugs are converted into a dictionary corresponding to the master table.

図７は、一つの医薬品に対して生成された辞書テーブルおよびマスタテーブルの登録内容を示す。辞書テーブルには、マスタテーブルのバージョン、医薬品名、医薬品コードとが対応付けて登録されている。図示のとおり、同一の医薬品を表現する用語が複数存在し、それぞれの医薬品名が、同一の医薬品コード（識別コード）に対応付けられている。マスタテーブルには、医薬品名、医薬品コードに対応付けて、様々な属性情報が登録されている。ここで、属性情報は、薬価（保険点数）、全体量、濃度、単位、単位量、製造メーカなどである。 FIG. 7 shows registration contents of a dictionary table and a master table generated for one medicine. In the dictionary table, the version, drug name, and drug code of the master table are registered in association with each other. As shown in the figure, there are a plurality of terms representing the same medicine, and each medicine name is associated with the same medicine code (identification code). Various attribute information is registered in the master table in association with the drug name and drug code. Here, the attribute information includes drug price (insurance score), total amount, concentration, unit, unit amount, manufacturer, and the like.

マッチング処理部５２が、テキストデータ処理部５０において抽出されたテキストデータを、振分テーブルにより特定される辞書テーブルから検索してマッチング処理を行う。具体的に、マッチング処理部５２が、辞書テーブルを検索して、当該テキストデータの医療関係用語が記憶されているか否かを判断する。なお、本実施例では、振分テーブルを参照することで、抽出したテキストデータが医療関係用語であることが判定されているため、辞書テーブルには、そのテキストデータが存在していることになる。例えば、テキストデータが「塩酸エフェドリン注射液（４％１ｍｌ）」である場合、マッチング処理部５２は、同一の医療関係用語が医薬品辞書テーブルに存在していることを判断し、医薬品コードを読み出して、レセプトファイル生成部５４に通知する。レセプトファイル生成部５４は、この通知を受けると、医薬品コードをもとに、テキストデータ「塩酸エフェドリン注射液（４％１ｍｌ）」を標準語「塩酸エフェドリン注射液４％１ｍｌ」に対応付けてレセプトファイルを生成する。例えば、レセプトファイル生成部５４は、このテキストデータと医薬品コード（１００００００１４７４１）とを対応付けたレセプトファイルを生成してもよく、また、マスタテーブルにおける塩酸エフェドリン注射液のデータアドレスと対応付けたレセプトファイルを生成してもよい。いずれにしても、レセプトファイル生成部５４は、医薬品コードをもとにテキストデータをマスタテーブルの標準語に対応付けて、レセプトファイルを生成する。 The matching processing unit 52 searches the text data extracted by the text data processing unit 50 from the dictionary table specified by the distribution table, and performs matching processing. Specifically, the matching processing unit 52 searches the dictionary table to determine whether or not medical related terms of the text data are stored. In the present embodiment, since the extracted text data is determined to be a medical term by referring to the distribution table, the text data exists in the dictionary table. . For example, when the text data is “ephedrine hydrochloride injection solution (4% 1 ml)”, the matching processing unit 52 determines that the same medical related term exists in the drug dictionary table, reads the drug code, and , Notify the receipt file generation unit 54. Upon receipt of this notification, the receipt file generation unit 54 associates the text data “ephedrine hydrochloride injection solution (4% 1 ml)” with the standard word “ephedrine hydrochloride injection solution 4% 1 ml” based on the drug code. Generate a file. For example, the receipt file generation unit 54 may generate a receipt file that associates the text data with the drug code (100000014741), or a receipt file that associates the data address of the ephedrine hydrochloride injection solution in the master table. May be generated. In any case, the receipt file generation unit 54 generates a receipt file by associating the text data with the standard word of the master table based on the medicine code.

なお、テキストデータが、辞書テーブルに含まれない「塩酸エフェドリン注射液４％１ｍｌ」を表現する他の方言である場合、マッチング処理部５２は、その名前を「塩酸エフェドリン注射液４％１ｍｌ」の方言として認識できない。辞書テーブルに対応する名前が存在しない場合、マッチング処理部５２は、そのテキストデータを、不明データとしてデータ格納部６６の所定の格納領域に記憶させる。 When the text data is another dialect expressing “ephedrine hydrochloride injection solution 4% 1 ml” not included in the dictionary table, the matching processing unit 52 uses the name “ephedrine hydrochloride injection solution 4% 1 ml”. It cannot be recognized as a dialect. When there is no name corresponding to the dictionary table, the matching processing unit 52 stores the text data as unknown data in a predetermined storage area of the data storage unit 66.

オペレータは、所定の格納領域に記憶された名前を見て、これが塩酸エフェドリン注射液４％１ｍｌの別の呼び名であることを判断すると、辞書テーブルを更新して、塩酸エフェドリン注射液４％１ｍｌの方言として追加する。このとき、図７に示すように、新たな方言と医薬品コード（１００００００１４７４１）とを対応付けて追加登録する。なお、新しい傷病が発生した場合、または新薬がでた場合は、本来であればマッチング処理の前にマスタテーブルが更新されているべきであるが、そのテーブル更新が間に合わなかった場合であれば、オペレータが、マスタテーブルおよび辞書テーブルを適宜更新していく。 When the operator looks at the name stored in the predetermined storage area and determines that this is another name for the ephedrine hydrochloride injection solution 4% 1 ml, the operator updates the dictionary table, and the ephedrine hydrochloride injection solution 4% 1 ml. Add as a dialect. At this time, as shown in FIG. 7, a new dialect and a medicine code (100000014741) are additionally registered in association with each other. If a new injury or illness occurs or if a new drug appears, the master table should be updated before the matching process, but if the table update is not in time, The operator updates the master table and the dictionary table as appropriate.

医薬品の場合には、特に数量表現について、レセプト上で比較的自由な表現がなされることが多い。テキストデータ「塩酸エフェドリン注射液４％１ｍｌ１Ａ」に対して、テキストデータ処理部５０は、「塩酸エフェドリン注射液４％１ｍｌ１Ａ」を、「塩酸エフェドリン注射液４％１ｍｌ」（医薬品名）、「１」（数量）、「Ａ」（単位）とに分解する。このとき、「Ａ」（単位）の表現が「アンプル」を示すものであることをオペレータが判断すると、「塩酸エフェドリン注射液４％１ｍｌ１Ａ」を辞書テーブルに追加する。図７の辞書テーブルは、このようにしてオペレータにより追加された塩酸エフェドリン注射液４％１ｍｌ１Ａを含んだ状態を示す。 In the case of pharmaceuticals, in particular, quantitative expressions are relatively free on the receipt in many cases. For the text data “Ephedrine hydrochloride injection solution 4% 1 ml 1A”, the text data processing unit 50 changes “Ephedrine hydrochloride injection solution 4% 1 ml 1A” to “Ephedrine hydrochloride injection solution 4% 1 ml” (pharmaceutical name), “1”. (Quantity) and “A” (Unit). At this time, if the operator determines that the expression “A” (unit) indicates “ampoule”, “ephedrine hydrochloride injection solution 4% 1 ml 1A” is added to the dictionary table. The dictionary table of FIG. 7 shows a state including 1 ml 1A of ephedrine hydrochloride injection solution 4% added by the operator in this way.

一方、「塩酸エフェドリン注射液４％１ｍｌ１Ａ」についての辞書テーブルが作成されていない場合、マッチング処理部５２が、この対応関係を自律的に判断してもよい。マッチング処理部５２は、抽出されたテキストデータとマスタテーブルまたは辞書テーブルに記憶された医療関係用語との一致性を判断し、一致しない部分が数量表現である場合に、その数量表現を、マスタテーブルまたは辞書テーブルに記憶された医療関係用語に含まれる数量表現と対応付けるべきものと判断する。 On the other hand, when the dictionary table for “ephedrine hydrochloride injection solution 4% 1 ml 1A” is not created, the matching processing unit 52 may autonomously determine this correspondence. The matching processing unit 52 determines the consistency between the extracted text data and the medical-related terms stored in the master table or the dictionary table, and when the unmatched portion is a quantity expression, the quantity expression is Or it judges that it should be matched with the quantity expression contained in the medical related term memorize | stored in the dictionary table.

具体的に、マッチング処理部５２は、「塩酸エフェドリン注射液４％１ｍｌ」を含む医療関係用語をマスタテーブルにおいて探索する。このとき、先に辞書テーブルを検索して医薬品コードを特定してから、マスタテーブルを参照しにいってもよい。マッチング処理部５２は、マスタテーブルに含まれる塩酸エフェドリン注射液４％１ｍｌのデータ内容を参照して、塩酸エフェドリン注射液４％１ｍｌの数量表現の単位が「１管」であることを認識する。マッチング処理部５２は、「塩酸エフェドリン注射液４％１ｍｌ１Ａ」の２番目の「１」については、数字であるため数量の表現であることを認識し、「Ａ」については、分類が不明な表現であるが、数字に続く文字であることから、単位表現であることを推測する。この推測に基づき、マッチング処理部５２は、「Ａ」が「管」を表現するものであることを認識し、「１Ａ」が、「１管」に対応することを判断する。以上により、マッチング処理部５２は、テキストデータの示す表現が辞書テーブルに登録されていなくとも、数量表現の方言を標準語に対応付けることができ、レセプトファイル生成部５４は、「塩酸エフェドリン注射液４％１ｍｌ１Ａ」に、医薬品コード（１００００００１４７４１）を対応付けたレセプトファイルを生成することが可能となる。 Specifically, the matching processing unit 52 searches the master table for medical related terms including “ephedrine hydrochloride injection solution 4% 1 ml”. At this time, it may be possible to refer to the master table after first searching the dictionary table to identify the drug code. The matching processing unit 52 refers to the data content of the ephedrine hydrochloride injection solution 4% 1 ml contained in the master table and recognizes that the unit of the quantity expression of the ephedrine hydrochloride injection solution 4% 1 ml is “1 tube”. The matching processing unit 52 recognizes that the second “1” of “ephedrine hydrochloride injection 4% 1 ml 1A” is a number because it is a number, and “A” has an unknown classification. However, it is a unit expression because it is a character following a number. Based on this guess, the matching processing unit 52 recognizes that “A” represents “tube”, and determines that “1A” corresponds to “1 tube”. As described above, the matching processing unit 52 can associate the dialect of the quantity expression with the standard word even if the expression indicated by the text data is not registered in the dictionary table, and the receipt file generating unit 54 can select “Ephedrine hydrochloride injection 4 It is possible to generate a receipt file in which the drug code (100000014741) is associated with “% 1ml1A”.

また、所定の格納領域に記憶された名前が特殊な表現の場合、オペレータは、所定の期間に限り利用される辞書テーブル（一時辞書テーブル）を作成する。特殊な表現とは、既述のように辞書テーブルに記憶されておらず、且つ通常使用されない文字列を意味する。例えば、塩酸エフェドリン注射液４％１ｍｌの一般的な別の呼び名としては分類することができず、ある医療機関においてのみ使用されている表現であったり、または、医療関係用語の単なる誤記である場合に、オペレータは、その文字列を特殊な表現として分類し、一時辞書テーブルを作成する。 When the name stored in the predetermined storage area is a special expression, the operator creates a dictionary table (temporary dictionary table) that is used only for a predetermined period. The special expression means a character string that is not stored in the dictionary table as described above and is not normally used. For example, it cannot be classified as another common name for 4% 1 ml of ephedrine hydrochloride injection solution, and it is an expression used only in a certain medical institution, or is simply a typographical error in medical terms The operator classifies the character string as a special expression and creates a temporary dictionary table.

一時辞書テーブルは、所定の期間、例えばその月に限って使用される辞書であり、作成されると一時辞書テーブル格納部７４に記憶される。医薬品名「塩酸エフェドリン注射液４％１ｍｌ」に対して、テキストデータが「塩酸エファドリン注射液４％１ｍｌ」との誤記がなされている場合、他に選択肢がなく、明らかに「塩酸エフェドリン注射液４％１ｍｌ」の誤記であることが判明するのであれば、その月限りの辞書として、「塩酸エファドリン注射液４％１ｍｌ」を正式な呼び名「塩酸エフェドリン注射液４％１ｍｌ」に紐付けさせる。また、ある医療機関においてのみ使用されている特殊な表現も同様であり、オペレータにより標準語に紐付けされる。このように、一時辞書テーブルは、特殊な表現をマスタテーブルに記憶されている標準語の識別コードに対応付けて作成される。レセプトファイル生成システム３では、レセプトの記入事項の誤りなどの特殊表現を修正するのではなく、その特殊表現に対して標準語の識別コードを対応付けることで、レセプトの記入事項を保存しながら、統一的な処理を可能とするデータとして取扱うこととする。 The temporary dictionary table is a dictionary that is used only for a predetermined period, for example, the month. When the temporary dictionary table is created, the temporary dictionary table is stored in the temporary dictionary table storage unit 74. If the text name of the drug name “ephedrine hydrochloride injection 4% 1 ml” is erroneously written as “ephedrine hydrochloride injection 4% 1 ml”, there is no other option, clearly “ephedrine hydrochloride injection 4 If it is determined that the error is “% 1 ml”, “the ephedrine hydrochloride injection solution 4% 1 ml” is linked to the official name “ephedrine hydrochloride injection solution 4% 1 ml” as the dictionary for the month. The same applies to special expressions that are used only in certain medical institutions, and are linked to standard words by the operator. In this way, the temporary dictionary table is created by associating a special expression with the identification code of the standard word stored in the master table. The receipt file generation system 3 does not correct a special expression such as an error in a receipt entry, but associates a standard language identification code with the special expression to save the receipt entry in a unified manner. It will be handled as data that can be processed.

医薬品の場合、基本的に名前付けが自由であり、特にカタカナの組合せであることが多いため、当月では類似した呼び名の医薬品が存在していなくても、翌月には、「塩酸エファドリン注射液４％１ｍｌ」に一致する医薬品が新薬として登場する可能性もある。そのため、誤記などの特殊な表現と認められる名前については、方言として辞書テーブルに登録するのではなく、一時限りの辞書として利用するために一時辞書テーブルに登録して、一時辞書テーブル格納部７４に格納する。レセプトファイル生成部５４は、一時辞書テーブルを参照して、特殊な表現を示すテキストデータをマスタテーブルの標準語に対応付けたレセプトファイルを生成する。 In the case of medicines, naming is basically free, and in particular, there are many combinations of katakana, so even if there are no medicines with similar names in this month, “Ephadrine Hydrochloride Injection 4” There is a possibility that a drug matching “% 1 ml” will appear as a new drug. Therefore, names that are recognized as special expressions such as typographical errors are not registered in the dictionary table as dialects, but are registered in the temporary dictionary table for use as a temporary dictionary and stored in the temporary dictionary table storage unit 74. Store. The receipt file generation unit 54 refers to the temporary dictionary table and generates a receipt file in which text data indicating a special expression is associated with a standard word of the master table.

以上のように、レセプトファイル生成部５４は、テキストデータ中の方言や特殊な表現を標準語に紐付けしたレセプトファイルを生成する。このレセプトファイルは、もとの紙レセプトに記入されていた内容をそのまま残し、含まれる方言や特殊表現については、辞書テーブルまたは一時辞書テーブルを参照することで標準語に対応付けられている。 As described above, the receipt file generation unit 54 generates a receipt file in which dialects and special expressions in text data are linked to standard words. This receipt file leaves the contents entered in the original paper receipt as they are, and the dialects and special expressions contained therein are associated with standard words by referring to the dictionary table or temporary dictionary table.

レセプトファイルには様々な利用方法がある。例えば、統計的な処理や、使用される医薬品の動向など、研究者や医薬品メーカにとって有用なデータ処理に利用することができる。さらに、暗号化した個人情報に対応付けてレセプトファイルを作成するため、個人の時系列的な診療履歴を把握することができ、適切な医療行為が施されているかのチェックも可能となる。また予測医学も可能であり、患者の健康維持のためのデータとしても活用できる。これらの処理は、データ加工部５８により実行される。 There are various ways to use the receipt file. For example, it can be used for data processing useful for researchers and pharmaceutical manufacturers, such as statistical processing and trends in pharmaceuticals used. Furthermore, since the receipt file is created in association with the encrypted personal information, it is possible to grasp the personal time-series medical history and check whether appropriate medical practice is being performed. Predictive medicine is also possible and can be used as data for maintaining the health of patients. These processes are executed by the data processing unit 58.

上記した実施例では、辞書テーブルを振分テーブルを用いて選択する例について説明したが、別の方法として、振分テーブルを利用せずに、マッチング処理部５２が、全ての種類の辞書テーブルを参照して、対応する医療関係用語を検索してもよい。 In the above-described embodiment, an example in which a dictionary table is selected using a sorting table has been described. As an alternative method, the matching processing unit 52 uses all sorts of dictionary tables without using the sorting table. The corresponding medical term may be searched with reference to the reference.

また上記した振分テーブルを使うよりも、さらに検索時間を短縮するために、振分テーブル格納部７８が、標準語およびその方言に特徴的な文字列を、参照する辞書テーブルの種類に対応付けた振分テーブルを格納してもよい。塩酸エフェドリン注射液４％１ｍｌに関して言えば、この振分テーブルは、全ての方言において共通となる文字列「エフェドリン注射液」を、参照する辞書テーブル、すなわち医薬品辞書テーブルに対応付けている。マッチング処理部５２は、振分テーブルを参照することで、参照する辞書テーブルとして医薬品辞書テーブルを特定することが可能となる。振分テーブルのデータ量を少なくすることで、振分テーブルの検索時間を短縮することができ、ひいてはマッチング処理時間を短縮することが可能となる。 Further, in order to further reduce the search time than using the above-described distribution table, the distribution table storage unit 78 associates the character strings characteristic of the standard word and its dialect with the type of dictionary table to be referred to. A sorting table may be stored. Regarding the ephedrine hydrochloride injection solution 4% 1 ml, this sorting table associates the character string “ephedrine injection solution” common to all dialects with a dictionary table to be referred to, that is, a pharmaceutical dictionary table. The matching processing unit 52 can specify the medicine dictionary table as the dictionary table to be referred to by referring to the distribution table. By reducing the amount of data in the distribution table, the search time for the distribution table can be shortened, and as a result, the matching processing time can be shortened.

図８は、標準語への対応付けを行う処理フローを示す。このフローでは、医薬品名を例とする。
まず、テキストデータを分解して、医薬品名を抽出する（Ｓ１０２）。マッチング処理部５２は、この医薬品名を辞書テーブルにおいて検索し（Ｓ１０４）、辞書テーブルに医薬品名として記憶されていることが判明した場合には（Ｓ１０４のＹ）、レセプトファイル生成部５４がそのテキストデータを医薬品コードと対応付けて、テキストデータをマスタテーブルの標準語に対応付けたレセプトファイルを生成する（Ｓ１０６）。 FIG. 8 shows a processing flow for associating with a standard word. In this flow, the drug name is taken as an example.
First, the text data is decomposed and the drug name is extracted (S102). The matching processing unit 52 searches the drug name in the dictionary table (S104), and if it is found that the drug name is stored in the dictionary table (Y in S104), the receipt file generating unit 54 reads the text. A receipt file in which the data is associated with the drug code and the text data is associated with the standard word of the master table is generated (S106).

辞書テーブルに登録されていない場合（Ｓ１０４のＮ）、マッチング処理部５２は、この医薬品名を、不明データとしてデータ格納部６６の所定の格納領域に記憶させる。オペレータは、この医薬品名を確認して、新薬であると判断した場合には（Ｓ１０８のＹ）、その医薬品名をマスタテーブルおよび辞書テーブルに追加し（Ｓ１１０）、標準語を医薬品名に対応付けたレセプトファイルを生成する。 When not registered in the dictionary table (N in S104), the matching processing unit 52 stores the medicine name as unknown data in a predetermined storage area of the data storage unit 66. If the operator confirms the drug name and determines that it is a new drug (Y in S108), the operator adds the drug name to the master table and dictionary table (S110), and associates the standard word with the drug name. Generate a receipt file.

新薬でない場合であって（Ｓ１０８のＮ）、ある標準語に対する新たな方言である場合には（Ｓ１１２のＹ）、オペレータは、この医薬品名を辞書テーブルに追加し、レセプトファイル生成部５４が、対応する標準語を医薬品名に対応付けたレセプトファイルを生成する（Ｓ１１４）。 If it is not a new drug (N in S108) and is a new dialect for a certain standard word (Y in S112), the operator adds this drug name to the dictionary table, and the receipt file generation unit 54 A receipt file in which the corresponding standard word is associated with the drug name is generated (S114).

新たな方言でもなく（Ｓ１１２のＮ）、特殊表現である場合には（Ｓ１１６のＹ）、オペレータが、この医薬品名を一時辞書テーブルに追加し、レセプトファイル生成部５４が、対応する標準語を、特殊表現された医薬品名に対応付けたレセプトファイルを生成する（Ｓ１１８）。特殊表現でない場合には（Ｓ１１６のＮ）、エラーメッセージがオペレータに通知される。 If it is not a new dialect (N in S112) and is a special expression (Y in S116), the operator adds this medicine name to the temporary dictionary table, and the receipt file generation unit 54 selects the corresponding standard word. Then, a receipt file associated with the specially expressed drug name is generated (S118). If it is not a special expression (N in S116), an error message is notified to the operator.

以上、本発明を実施例をもとに説明した。これらの実施例は例示であり、それらの各構成要素や各処理プロセスの組合せにいろいろな変形例が可能なこと、またそうした変形例も本発明の範囲にあることは当業者に理解されるところである。 In the above, this invention was demonstrated based on the Example. These embodiments are exemplifications, and it is understood by those skilled in the art that various modifications can be made to the combination of each component and each processing process, and such modifications are within the scope of the present invention. is there.

本実施例では、レセプトファイル生成システム３が、医療関係用語の標準語と方言とを対応付けたレセプトファイルを生成するものとして説明した。例えば、本技術は、レセプトだけでなく、例えば電子カルテなどのテキスト化されたカルテに対して、医療関係用語の標準語と方言とを対応付けたカルテファイルを生成するカルテファイル生成システムとしても利用することが可能である。カルテファイル生成システムにおいては、本実施例における対象をカルテに置換することで対応することができ、複数間の病院間での医療関係用語の統一に大きく貢献することが可能である。また、このシステムは、医療関係用語だけでなく、同義語が多く存在する分野において好適に適用することができ、有用に機能する。例えば、法曹業界において弁護士が作成する契約書などの書類を統一的に管理するために、同じ意味を表すために使用される法曹用語を標準語と方言とに分類し、方言を標準語に対応付けたデータファイルを生成することで、書類の検索を容易にすることも可能となる。 In the present embodiment, the description has been given on the assumption that the receipt file generation system 3 generates a reception file in which standard words of medical related terms are associated with dialects. For example, this technology can be used not only as a receipt, but also as a medical record file generation system that generates a medical record file that associates standard medical terms and dialects with texts such as electronic medical records. Is possible. In the medical record file generation system, it is possible to cope with the problem by replacing the target in this embodiment with a medical record, and it is possible to greatly contribute to the unification of medical related terms among a plurality of hospitals. Moreover, this system can be suitably applied in a field where there are many synonyms as well as medical-related terms and functions usefully. For example, in order to uniformly manage documents such as contracts created by lawyers in the legal industry, legal terms used to represent the same meaning are classified into standard words and dialects, and dialects correspond to standard words. By creating the attached data file, it is possible to easily search for documents.

実施例におけるレセプト処理フローを示す図である。It is a figure which shows the receipt processing flow in an Example. 実施例におけるレセプト処理システムを示す図である。It is a figure which shows the receipt processing system in an Example. テキストデータ生成システムの構成を示す図である。It is a figure which shows the structure of a text data generation system. 診療報酬明細書（レセプト）のイメージデータを示す図である。It is a figure which shows the image data of a medical remuneration statement (receipt). 摘要欄を文字認識して、項目ごとに分類して生成したデータファイルの表示例を示す図である。It is a figure which shows the example of a display of the data file which recognized the summary column and classified and produced | generated for every item. レセプトファイル生成システムの構成を示す図である。It is a figure which shows the structure of a receipt file production | generation system. 一つの医薬品に対して生成された辞書テーブルおよびマスタテーブルの登録内容を示す図である。It is a figure which shows the registration content of the dictionary table produced | generated with respect to one pharmaceutical, and a master table. 標準語への対応付けを行う処理フローを示す図である。It is a figure which shows the processing flow which matches with a standard word.

Explanation of symbols

１・・・レセプト処理システム、２・・・テキストデータ生成システム、３・・・レセプトファイル生成システム、１０・・・イメージデータ生成部、１２・・・ＯＣＲ部、１６・・・入力部、１８・・・ディスプレイ、２０・・・ドライブ装置、２２・・・格納部、３０・・・記憶媒体、３２・・・レセプト画像、３４・・・個人情報画像、３６・・・薬歴画像、３８・・・摘要欄、５０・・・テキストデータ処理部、５２・・・マッチング処理部、５４・・・レセプトファイル生成部、５５・・・論理チェック処理部、５６・・・対応チェック処理部、５８・・・データ加工部、６０・・・入力部、６２・・・ディスプレイ、６４・・・参照テーブル格納部、６６・・・データ格納部、６８・・・ドライブ装置、７０・・・マスタテーブル格納部、７２・・・辞書テーブル格納部、７４・・・一時辞書テーブル格納部、７６・・・分解テーブル格納部、７７・・・結合テーブル格納部、７８・・・振分テーブル格納部。
DESCRIPTION OF SYMBOLS 1 ... Receptacle processing system, 2 ... Text data generation system, 3 ... Receipt file generation system, 10 ... Image data generation part, 12 ... OCR part, 16 ... Input part, 18 ... Display, 20 ... Drive device, 22 ... Storage unit, 30 ... Storage medium, 32 ... Receive image, 34 ... Personal information image, 36 ... Medical history image, 38 ... Summary column, 50 ... Text data processing unit, 52 ... Matching processing unit, 54 ... Receipt file generation unit, 55 ... Logic check processing unit, 56 ... Corresponding check processing unit, 58 ... Data processing unit, 60 ... Input unit, 62 ... Display, 64 ... Reference table storage unit, 66 ... Data storage unit, 68 ... Drive device, 70 ... Master table Storage unit, 72 ... dictionary table storage unit, 74 ... one o'clock dictionary table storage unit, 76 ... separation table storage unit, 77 ... coupling table storage unit, 78 ... sorting table storing unit.

Claims

A master table storage unit that stores a master table that associates medical-related terms serving as standard words with their identification codes and attribute information;
A dictionary table storage unit for storing a dictionary table in which synonyms representing the same meaning as a standard word as medical related terms are associated with an identification code of the standard word;
A text data processing unit for extracting text data contained in a data file in which a receipt is converted into text;
A matching processing unit that searches the extracted text data in a dictionary table;
When the matching processing unit finds that the text data is stored as a medical term in the dictionary table, a receipt file that associates the text data with the standard word of the master table based on the identification code A receipt file generator for generating
A receipt file generation system comprising:

The said matching process part memorize | stores the text data in a predetermined | prescribed storage area as unknown data, when the said text data is not memorize | stored as a medical-related term in the dictionary table. Receipt file generation system.

A temporary dictionary table storage unit that stores a temporary dictionary table in which character strings that are not stored in the dictionary table and are not normally used are associated with identification codes of standard words;
The said receipt file production | generation part produces | generates the receipt file which matched the text data of the said character string with the standard word of a master table based on the temporary dictionary table. Receipt file generation system.

The said receipt file production | generation part produces | generates a receipt file only using the correspondence memorize | stored in the temporary dictionary table about the text data of the said character string only for a predetermined period, The receipt of Claim 3 characterized by the above-mentioned. File generation system.

5. The receipt file generation according to claim 1, wherein each of the master table storage unit and the dictionary table storage unit stores the master table and the dictionary table for each type of medical terms. system.

A distribution table storage unit that stores a distribution table in which character strings are associated with types of dictionary tables to be referenced;
6. The receipt file generation system according to claim 5, wherein the matching processing unit specifies a dictionary table to be referred to the extracted text data with reference to the distribution table.

A decomposition table storage unit for storing a character string and a decomposition table associated with a plurality of medical terms that are decomposed by dividing the character string;
7. The receipt file generation according to claim 1, wherein the text data processing unit extracts text data obtained by disassembling text data of a character string for each medical term with reference to a disassembly table. system.

A combined table storage unit that stores a combined table that lists medical-related terms generated by combining a plurality of character strings;
8. The receipt file generation system according to claim 7, wherein the text data processing unit extracts text data obtained by combining text data of a plurality of character strings with reference to a combination table.

The matching processing unit determines the matching between the extracted text data and medical related terms stored in the master table or the dictionary table, and when the unmatched portion is a quantity expression, the quantity expression is The receipt file generation system according to any one of claims 1 to 8, wherein it is determined that it should be associated with a quantity expression included in a medical-related term stored in a master table or a dictionary table.

A master table storage unit that stores a master table that associates medical-related terms serving as standard words with their identification codes and attribute information;
A dictionary table storage unit for storing a dictionary table in which synonyms representing the same meaning as a standard word as medical related terms are associated with an identification code of the standard word;
A text data processing unit for extracting text data included in the data file of the electronic medical record;
A matching processing unit that searches the extracted text data in a dictionary table;
When the matching processing unit finds that the text data is stored as a medical term in the dictionary table, based on the identification code, the medical record file that associates the text data with the standard word of the master table A file generator for generating
A medical record file generation system comprising:

A master table storage unit that stores a master table that associates a term that is a standard word with its identification code and attribute information;
A dictionary table storage unit that stores a dictionary table in which a standard word and a synonym representing the same meaning as the standard word are associated with an identification code of the standard word;
A text data processing unit for extracting text data contained in the data file;
A matching processing unit that searches the extracted text data in a dictionary table;
A file for generating a data file in which the text data is associated with the standard word of the master table based on the identification code when the matching processing unit finds that the text data is stored in the dictionary table A generator,
A file generation system comprising: