JP2019185403A

JP2019185403A - Patient matching method and device in receipt information/specific medical examination information database

Info

Publication number: JP2019185403A
Application number: JP2018075685A
Authority: JP
Inventors: 知明今村; Tomoaki Imamura; 恒之東野; Tsuneyuki Higashino; 龍也野田; Tatsuya Noda; 慎一郎久保; Shinichiro Kubo; 源太加藤; Genta Kato; 祐一西岡; Yuichi Nishioka; 大也明神; Daiya Myojin
Original assignee: Nara Medical University
Current assignee: Nara Medical University
Priority date: 2018-04-10
Filing date: 2018-04-10
Publication date: 2019-10-24

Abstract

To provide a patient matching method and a device for improving efficiency and accuracy of computer-assisted name identification in receipt information database for NDB.SOLUTION: A device collates the data in a receipt information database having a first hash value based on an insurer number and a second hash value based on a name, and performs computer-assisted name identification. The device first extracts the first hash value, the second hash value, and a medical treatment date by separately using a receipt other than a dispensing receipt, and the dispensing receipt, and concatenates, based on a match between the first hash value and the second hash value over a plurality of months, a receipt, and generates an intermediate table for identifying a same patient. The device next integrates the intermediate table of the same patient along time series medical treatment date and generates a correspondence table of the first hash value in the same patient. The device then generates, when a plurality of types of first hash values exist in the correspondence table, a computer-assisted name identification table in which the first hash value is replaced with a unique first hash value.SELECTED DRAWING: Figure 1

Description

本発明は、レセプト情報・特定健診等情報データベース（ＮＤＢ）におけるレセプトデータ及び特定健診等のデータを突合して患者の名寄せを行う技術に関するものである。 The present invention relates to a technique for identifying patients by collating the receipt data and specific medical examination data in an information database (NDB) such as a receipt information / specific medical examination.

レセプト情報・特定健診等情報データベース（ＮＤＢ）とは、診療報酬請求のために、病院等から審査支払機関及び保険者に送信される電子レセプトデータ、及び保険者の保有する特定健診等のデータを、匿名化処理を経て格納・構築したデータベースであり、国民皆保険制度を採る我が国における保険診療の悉皆データである。ＮＤＢは平成２１年４月〜平成２８年１２月診療分で、約１２８億８，４００万件（平成２９年３月末時点）のレセプトデータが蓄積されるなど、世界最大級の健康関連データベースであり、これを有効に活用することで各種の臨床研究、政策研究が強力に推進できると期待されている。 Information database (NDB) such as receipt information / specific medical examinations, such as electronic receipt data transmitted from hospitals etc. to examination paying institutions and insurers, and specific medical examinations held by insurers etc. It is a database that stores and constructs data through anonymization processing, and is a comprehensive data on insurance practice in Japan that employs the universal health insurance system. NDB is the world's largest health-related database, with approximately 12,884 million (as of the end of March 2017) received data from April 2009 to December 2016. Yes, it is expected that various clinical research and policy research can be strongly promoted by utilizing this effectively.

しかし現状は、疾患別、地域別等の部分的な活用に留まっており、成果が十分に上がっているとは言いがたい。その理由として、ＮＤＢの巨大なサイズと並び、診療報酬請求のために設定されているレセプトの構造がそのままでは研究目的での利用に適さない形式となっていることが主因として挙げられる。その中でも、後述する「名寄せ」は大きな問題点である。 However, the current situation is limited to partial utilization by disease, region, etc., and it is hard to say that the results have been sufficiently improved. The main reason for this is that, along with the huge size of the NDB, the structure of the receipt set for requesting medical fees is not suitable for use for research purposes. Among them, “name identification” described later is a big problem.

各医療機関は、患者ごとに、毎月、診療報酬を請求するので、レセプトデータは、１人の患者に対し、医療機関単位、１か月単位で送信されている。患者が複数月に渡って受診したり、同一月に複数の医療機関を受診したりすることは頻繁にあるため、同一患者の複数レセプトをつなぎ合わせる「名寄せ」作業を行わなければ、個人単位での分析を行うことはできない。そのため、ＮＤＢには、名寄せを可能とする個人紐付け用の匿名変数として、「ＩＤ１」と「ＩＤ２」が用意されている。ＩＤ１は保険者番号、被保険者証等記号・番号、生年月日、性別から個人情報保護のためにハッシュ関数を用いてハッシュ化された英数字列であり、ＩＤ２は氏名、生年月日、性別から同様にハッシュ関数を用いてハッシュ化された英数字列である。 Since each medical institution charges a medical fee every month for each patient, the receipt data is transmitted to one patient in units of medical institutions and monthly. Patients often visit for multiple months or visit multiple medical institutions in the same month, so if you do not perform “name collation” to connect multiple receipts from the same patient, Cannot be analyzed. Therefore, “ID1” and “ID2” are prepared in the NDB as anonymous variables for personal association that enable name identification. ID1 is an alphanumeric string hashed using a hash function to protect personal information from insurer number, insured person's identification code / number, date of birth, gender, ID2 is name, date of birth, Similarly, it is an alphanumeric string hashed using a hash function from gender.

ここで、ハッシュ関数とは、与えられたデータから一定長の疑似乱数（ハッシュ値）を生成するものであり、異なるデータから同じハッシュ値を生成することは極めて困難であるとされている。また生成された値（ハッシュ値）から元データを再現することはできないため、データの匿名化に有用とされている。しかしながら、保険者番号、記号・番号、生年月日、性別及び氏名といった個人情報を基にハッシュ値を生成するため、１つのハッシュ値を生成するだけでは、保険者番号や氏名といった個人情報に変化があった場合に突合が困難となる。そのため、ＮＤＢでは「ＩＤ１」と「ＩＤ２」の２つのハッシュ値が用意され、これらのハッシュ値を基に突合する構造となっている。 Here, the hash function is to generate a fixed-length pseudorandom number (hash value) from given data, and it is extremely difficult to generate the same hash value from different data. Further, since the original data cannot be reproduced from the generated value (hash value), it is useful for anonymizing data. However, since a hash value is generated based on personal information such as the insurer number, symbol / number, date of birth, gender, and name, simply generating one hash value changes the personal information such as the insurer number and name. If there is, it becomes difficult to join. Therefore, in NDB, two hash values “ID1” and “ID2” are prepared and have a structure that collates based on these hash values.

ところが、同一患者でも、就職・転職等で保険者は変化し、医療機関での表記ゆれ（例えば、“渡辺”と“渡邉”）や結婚・離婚等で氏名表記は変化するため、ＩＤ１、ＩＤ２ともに容易に変わり得ることが分かっている。下記表１は、別人物にも拘らずＩＤが同じとなる場合をまとめた表である。 However, even in the same patient, the insurer changes due to employment, job change, etc., and the name notation changes due to fluctuations in medical institutions (for example, “Watanabe” and “Watanabe”) and marriage / divorce etc., ID1, ID2 Both have been found to be easily changeable. Table 1 below is a table summarizing cases where the IDs are the same regardless of different persons.

上記表１に示すように、別人物にも拘らずＩＤが同じとなる場合としては、ＩＤ１もＩＤ２も同じである場合、ＩＤ１が異なるがＩＤ２が同じである場合及びＩＤ１が同じでＩＤ２が異なる場合が考えられる。
まず、ＩＤ１もＩＤ２も同じである場合とは、別人物同士で同一のＩＤ１、ＩＤ２が生成された場合である（Ｘ１）。ＩＤ１が異なるがＩＤ２が同じである場合とは、別人物同士で同一のＩＤ２が生成された場合（Ｙ１）や、同姓同名・同一生年月日で、同じ性別だった場合（Ｙ２）のことである。ＩＤ１が同じでＩＤ２が異なる場合とは、別人物同士で同一のＩＤ１が生成された場合（Ｚ１）や、同性・同一生年月日の複産(双子等)の場合（Ｚ２）のことである。なお、平成２６年の人口動態調査によれば、複産全体は１年間に１万件発生している。
下記表２は、表１とは異なり、同一人物にも拘らずＩＤが合わない場合をまとめた表である。 As shown in Table 1 above, when the ID is the same regardless of another person, when ID1 and ID2 are the same, when ID1 is different but ID2 is the same, and when ID1 is the same and ID2 is different There are cases.
First, the case where ID1 and ID2 are the same is a case where the same ID1 and ID2 are generated between different persons (X1). The case where ID1 is different but ID2 is the same is when the same ID2 is generated between different persons (Y1) or when the same family name and same date of birth are the same gender (Y2). is there. The case where ID1 is the same and ID2 is different is the case where the same ID1 is generated between different persons (Z1), or the case of multiple births (such as twins) of the same sex / date of birth (Z2). . In addition, according to the 2014 population dynamics survey, there were 10,000 cases of total complications per year.
Unlike Table 1, Table 2 below summarizes the cases where the IDs do not match despite the same person.

上記表２に示すように、同一人物にも拘らずＩＤが合わない場合としては、ＩＤ１は変わるがＩＤ２は不変である場合、ＩＤ１は不変であるがＩＤ２が変わる場合及びＩＤ１もＩＤ２も変わる場合が考えられる。
まず、ＩＤ１は変わるがＩＤ２は不変である場合としては、就職・転職（Ａ１−１）、離職（Ａ１−２）、定年（Ａ１−３）、就職・転職等で扶養が外れる（Ａ１−４）、扶養者の婚姻・離婚による保険変更（Ａ２−１）、養子縁組による扶養者変更に伴う保険変更（Ａ２−２）、扶養者の変更（Ａ２−３）、国保加入者における居住地変更（Ａ３）、後期高齢者制度への加入（Ａ４）又は保険証番号が流出した患者（Ａ５）といった場合が考えられる。ＩＤ１は不変であるがＩＤ２が変わる場合としては、養子による改姓（Ｂ１）、結婚・離婚による改姓（Ｂ２−１）、扶養者の結婚・離婚等による改姓（Ｂ２−２）、氏名変更（Ｂ３）、記入ミス・氏名等の記載ゆれ（Ｂ４）又は国籍取得に伴う氏名変更（Ｂ５）といった場合が考えられる。 As shown in Table 2 above, when the ID does not match despite the same person, ID1 changes but ID2 is unchanged, ID1 is unchanged but ID2 changes, and ID1 and ID2 also change Can be considered.
First, in the case where ID1 changes but ID2 remains unchanged, the employment is lost due to employment / change of job (A1-1), turnover (A1-2), retirement age (A1-3), employment / change of job (A1-4) ), Insurance change due to marriage / divorce of dependents (A2-1), insurance change due to dependent changes due to adoption (A2-2), change of dependents (A2-3), change of residence for National Health Insurance members Cases such as (A3), joining a senior citizen system (A4), or a patient (A5) whose insurance card number has been leaked are considered. If ID1 is unchanged but ID2 changes, the changed name by adoptive child (B1), the changed name by marriage / divorce (B2-1), the changed name by the dependent's marriage / divorce (B2-2), the name change (B3) ), Typographical error (in Japanese) (B4) or name change (B5) due to nationality acquisition.

次に、ＩＤ１もＩＤ２も変わる場合としては、まず、上記の“Ａ”と“Ｂ”を掛け合わせた場合が考えられる。例えば、結婚に伴う退社・改姓（Ａ１−２＊Ｂ２−１）、離婚に伴う就職・改姓（Ａ１−１＊Ｂ２−１）又は養子による改姓・保険者変更（Ａ２−２＊Ｂ１）といった場合が考えられる。また“Ａ”と“Ｂ”を掛け合わせた場合以外としては、性転換（Ｃ３）の場合が考えられる。 Next, as a case where both ID1 and ID2 change, the case where the above-mentioned “A” and “B” are multiplied can be considered. For example, when leaving a company / name changed due to marriage (A1-2 * B2-1), finding a job / name changed due to divorce (A1-1 * B2-1), or adopting a changed name / insurer (A2-2 * B1) Can be considered. Besides the case where “A” and “B” are multiplied, the case of sex change (C3) can be considered.

同一人物にも拘らずＩＤが合わない場合については、例えば、就職・転職（Ａ１−１）、離職（Ａ１−２）又は定年（Ａ１−３）の何れかによりＩＤ１が変わった人は、厚生労働省「平成２６年雇用動向調査結果の概況」によると１年間に約７１３万人存在することが分かっている。就職・転職等で扶養が外れる（Ａ１−４）人は１年間に約８００万人、平成２６年の総務省統計局人口推計によると後期高齢者制度への加入（Ａ４）によりＩＤ１が変わる人が約１２２万人存在することが分かっている。また、２０１４年度時点での法務省戸籍統計によると、１年間の婚姻数は約６５万組であるので、それに近い数の改姓が行われていることも推測できる。このように、同一人物にも拘らずＩＤが合わない場合は非常に多く存在することが分かる。 In the case where the ID does not match despite the same person, for example, a person whose ID1 has changed due to employment / change of job (A1-1), turnover (A1-2) or retirement age (A1-3) According to the Ministry of Labor's “2014 Survey on Employment Trends,” it is known that there are approximately 7.13 million people per year. About 8 million people who fall out of support due to employment, job change, etc. (A1-4) per year, and according to the Ministry of Internal Affairs and Communications Statistics Bureau population estimate, ID1 changes due to joining the elderly system (A4) It is known that there are about 1,220,000 people. Moreover, according to the Ministry of Justice family register statistics as of 2014, the number of marriages per year is about 650,000, so it can be inferred that the number of renames has been changed. Thus, it can be seen that there are a large number of cases where the ID does not match despite the same person.

そのため、就職・転職等の前と後で、１人の患者による受診を異なる２人の患者の受診と認識する等の可能性が生じ、ＮＤＢを用いた患者数の推計や患者１人あたりの推計値は大きな誤差を含むと考えられる。この問題を解決するためには、ＮＤＢに一生涯不変の個人ＩＤを付与する必要があるが、その実現に向けては議論が緒に就いたばかりである。また、不変ＩＤの導入が実現しても過去データについては現行のＩＤ１とＩＤ２を用いるほかない。
なお、ＮＤＢの患者突合性を高める動きとして、ＩＤ３を付与する動きがある。しかし、ＩＤ３は特定健診のＩＤ１において前述の表記ゆれ（全角半角の違い、頭に０が付与されているか否か）を改善し、特定健診等のデータとレセプトのデータの患者突合性を改善するものである。したがって、ＩＤ３を使用しても名寄せを行う必要性に変わりはない。 For this reason, there is a possibility of recognizing a visit by one patient as a visit of two different patients before and after employment, job change, etc., and estimating the number of patients using NDB and per patient Estimated values are considered to contain large errors. In order to solve this problem, it is necessary to give the NDB a personal ID that remains unchanged throughout the lifetime, but discussions have just begun. Even if the introduction of the invariant ID is realized, the current ID1 and ID2 must be used for past data.
In addition, there exists a motion which provides ID3 as a motion which improves patient matching of NDB. However, ID3 improves the above-mentioned notation fluctuation (difference in full-width half-width, whether or not 0 is given to the head) in ID1 of specific medical checkup, and improves patient matching of data of specific medical checkup and receipt data. It is an improvement. Therefore, even if ID3 is used, the need for name identification remains unchanged.

レセプトデータ及び健診データの名寄せについては、健診データとレセプトデータとを突合できる健康管理システムが知られている（特許文献１を参照）。
これは、健診データベースと、レセプトデータベースと、対応付け手段と、健診データ取込手段と、レセプトデータ取込手段と、第１の連結キー取得手段と、健診データ追加手段と、第２の連結キー取得手段と、レセプトデータ追加手段とを備えるシステムである。これによれば、外部から健診データを取り込んだ場合には、対応付けられた個人情報に基づいてレセプトデータベースから連結キーを取得し、その連結キーを付与して取り込んだ健診データを健診データベースに追加登録し、一方、外部からレセプトデータを取り込んだ場合は、対応付けられた個人情報に基づいて健診データベースから連結キーを取得し、その連結キーを付与して取り込んだレセプトデータをレセプトデータベースに追加登録することができる。その結果、健診データとレセプトデータとを突合することができる。
しかしながら、上記特許文献１に開示された健康管理システムは、就職・転職や氏名の表記ゆれ等により、個人紐付け用の匿名変数が容易に変わってしまうという問題を解決するものではない。 A health management system that can collate medical checkup data with receipt data is known (see Patent Document 1).
This includes a medical examination database, a receipt database, an association means, a medical examination data fetching means, a receipt data fetching means, a first concatenated key acquisition means, a medical examination data adding means, a second This is a system comprising a concatenated key obtaining unit and a receipt data adding unit. According to this, when medical examination data is taken in from the outside, a connection key is obtained from the receipt database based on the associated personal information, and the medical examination data acquired by giving the connection key is examined. If you add to the database and receive receipt data from the outside, on the other hand, obtain a concatenated key from the medical examination database based on the associated personal information, and accept the received receipt data with the concatenated key. Additional registration in the database is possible. As a result, the medical examination data and the receipt data can be matched.
However, the health management system disclosed in Patent Document 1 does not solve the problem that the anonymous variable for personal association easily changes due to job hunting, job change, name fluctuation, or the like.

特開２００９−２７１７４３号公報JP 2009-271743 A

かかる状況に鑑みて、本発明は、ＮＤＢ等のレセプト情報データベースにおける名寄せの効率及び正確性を向上させる患者突合方法及び装置を提供することを目的とする。 In view of such circumstances, an object of the present invention is to provide a patient matching method and apparatus that improve the efficiency and accuracy of name identification in a receipt information database such as NDB.

上記課題を解決すべく、本発明の患者突合方法は、保険者番号に基づく第１のハッシュ値と氏名に基づく第２のハッシュ値の少なくとも２つの検索キーを有するレセプト情報データベースの突合方法であって、下記１）〜４）のステップを備え、データベースにおけるデータを突合して患者の名寄せを行う。
１）調剤レセプト以外のレセプトを用いて、第１のハッシュ値と第２のハッシュ値及び診療年月を少なくとも抽出し、診療年月の複数月にわたる第１のハッシュ値と第２のハッシュ値の少なくとも何れかの一致に基づいてレセプトを連結し、同一患者の第１の名寄せ用中間テーブルを生成する。
２）調剤レセプトを用いて、第１のハッシュ値と第２のハッシュ値及び診療年月を少なくとも抽出し、診療年月の複数月にわたる第１のハッシュ値と第２のハッシュ値の少なくとも何れかの一致に基づいてレセプトを連結し、同一患者の第２の名寄せ用中間テーブルを生成する。
３）同一患者の第１の名寄せ用中間テーブルと第２の名寄せ用中間テーブルを診療年月の時系列に沿って統合し、同一患者における第１のハッシュ値の対応テーブルを生成する。
４）対応テーブルにおいて複数種の第１のハッシュ値が存在する場合には、複数種の第１のハッシュ値をユニークな第１のハッシュ値に置き換えた名寄せテーブルを生成する。 In order to solve the above problems, the patient matching method of the present invention is a method for matching a receipt information database having at least two search keys, a first hash value based on an insurer number and a second hash value based on a name. Then, the following steps 1) to 4) are provided, and the data in the database are collated to identify patients.
1) At least a first hash value, a second hash value, and a medical date are extracted using a receipt other than the dispensing receipt, and the first hash value and the second hash value over a plurality of months of the medical date Based on at least one of the matches, the receipts are concatenated to generate a first name identification intermediate table for the same patient.
2) Using the dispensing receipt, extract at least the first hash value, the second hash value, and the medical date, and at least one of the first hash value and the second hash value over a plurality of months of the medical date Based on the match, the receipts are connected to generate a second name identification intermediate table for the same patient.
3) The first name identification intermediate table and the second name identification intermediate table of the same patient are integrated along the time series of the medical treatment date, and the correspondence table of the first hash value in the same patient is generated.
4) If multiple types of first hash values exist in the correspondence table, a name identification table is generated by replacing the multiple types of first hash values with unique first hash values.

１）のステップにおいて、第１のハッシュ値と第２のハッシュ値及び診療年月以外に転帰区分を抽出することが好ましく、さらに転帰区分としては“死亡転帰”を抽出することが好ましい。調剤レセプトをそれ以外のレセプトと分けて用いるのは、外来では、医師が発行する処方箋に基づき院外の薬局で調剤が行われることが多く、医科入院外と調剤についてはペアでレセプトが発生することが多いため、一括処理を行うと患者の紐づけ情報が多対多となり、純正なマッチングが困難になるからである。
なお、調剤レセプトとそれ以外のレセプトを分けずに、一緒に処理することも可能である。かかる場合には、１）のステップにおいて、調剤レセプトを含めた処理を行い、２）のステップを省略する。また、その場合、３）のステップにおいては、第１の名寄せ用中間テーブルと第２の名寄せ用中間テーブルの統合は行わず、第１の名寄せ用中間テーブルのみを基に、同一患者における第１のハッシュ値の対応テーブルを生成する。 In the step 1), it is preferable to extract an outcome category other than the first hash value, the second hash value, and the medical date, and it is preferable to extract “death outcome” as the outcome category. The reason for using dispensing receipts separately from other receipts is that in outpatients, dispensing is often done at an out-of-hospital pharmacy based on a prescription issued by a doctor, and there is a pair receipt for out-of-hospital and dispensing This is because, when batch processing is performed, patient linking information becomes many-to-many and genuine matching becomes difficult.
In addition, it is also possible to process together, without dividing a dispensing receipt and other receptors. In such a case, the processing including the dispensing receipt is performed in the step 1), and the step 2) is omitted. In this case, in the step 3), the first name identification intermediate table and the second name identification intermediate table are not integrated, and the first patient in the same patient is based only on the first name identification intermediate table. A hash value correspondence table is generated.

上記１）、２）の第１及び第２の名寄せ用中間テーブル生成ステップにおいて、具体的には、下記５），６）のステップを行う。
５）第１のハッシュ値と第２のハッシュ値の両方が一致する場合、同一患者のレセプトとして連結し、最初のレセプト発生月から最終のレセプト発生月まで、又は、最初のレセプト発生月から転帰区分にて死亡転帰が示される当該月まで、を１つのレセプトグループとして連結する。
６）それぞれのレセプトグループにおいて、第１のハッシュ値又は第２のハッシュ値の何れかのみ一致する場合、最終のレセプト発生月を連結対象とするレセプトグループの連結候補を、第１のハッシュ値が一致、最も時系列的に近い、同年月を除き期間がオーバーラップしない、又は、同じ第１若しくは第２のハッシュ値の候補の最初のレセプト発生月が同一ではない、の少なくとも何れかを満足する最初のレセプト発生月を連結対象とするレセプトグループと連結する。
ここで、レセプトグループとは、２つ以上のレセプトが紐付けされたものだけではなく、他のレセプトと紐付けされていない１つのレセプトも含む意味で用いている。 In the first and second name identification intermediate table generating steps 1) and 2), specifically, the following steps 5) and 6) are performed.
5) If both the first hash value and the second hash value match, concatenate as a receipt for the same patient and result from the first receipt occurrence month to the last receipt occurrence month or from the first receipt occurrence month Up to the month in which the death outcome is shown in the category, it is linked as a single reception group.
6) If only one of the first hash value and the second hash value matches in each of the receipt groups, the first hash value indicates the connection candidate of the reception group whose connection target is the last receipt occurrence month. Satisfies at least one of coincidence, closest in time series, non-overlapping periods except the same year and month, or the first receipt occurrence month of the same first or second hash value candidate is not the same The first receipt month is linked to the receipt group to be consolidated.
Here, the term “recept group” is used to mean not only a group in which two or more receptors are linked, but also one receptor that is not linked to other receptors.

第１のハッシュ値が一致するものを優先して連結するのは、第２のハッシュ値については、同姓同名・同一生年月日・同性患者が同一ＩＤとなってしまうが、第１のハッシュ値では、同じ扶養に入っている同性の双子等を除き、別人が同一ＩＤとなる頻度が比較的少ないからである。最も時系列的に近いものを優先して連結するのは、時系列的に近いものの方がより同一人物である可能性が高いからである。
同年月を除き期間がオーバーラップしないものを連結するのは、期間がオーバーラップする場合には、同一人物でない可能性が高いからである。また、同年月を除くのは、就職や結婚などの事由が発生した月においては、同一人について複数の異なる第１又は第２のハッシュ値が存在することとなるからである。 For the second hash value, the same hash name, the same birth date, and the same sex patient will have the same ID for the second hash value, but the first hash value is preferentially linked. This is because, except for the same-sex twins and the like who are in the same support, the frequency with which the same person has the same ID is relatively low. The reason why the ones closest in time series are preferentially connected is that the ones near in time series are more likely to be the same person.
The reason why the periods that do not overlap except for the same year and month are connected is because there is a high possibility that they are not the same person when the periods overlap. Also, the reason for excluding the same year and month is that there are a plurality of different first or second hash values for the same person in the month in which a reason such as employment or marriage occurs.

また、資格喪失したレセプトについては、審査支払機関を経由し保険者でチェックを行い再審査請求となるため、３か月間程度は新旧の第１のハッシュ値が並存し得ることとなる。そこで、第２のハッシュ値の一致に基づいてレセプトを連結する際には、同年月だけではなく、最終のレセプト発生月の前月及び前々月についてオーバーラップするものであっても連結してもよい。かかる場合においても、最終のレセプト発生月以降に最初のレセプト発生月を有するレセプトグループの方が、最終のレセプト発生月の前月又は前々月に最初のレセプト発生月を有するレセプトグループよりも同一人物である可能性は高いと考えられるため、最終のレセプト発生月以降、前月、前々月の順に優先して連結を行うことが好ましい。
同じ第１若しくは第２のハッシュ値の候補の最初のレセプト発生月が同一ではないものを連結するのは、最初のレセプト発生月が同一である場合には、どちらのレセプトグループと連結すべきかの判断が困難であるからである。 In addition, the lost qualification receipt is checked by the insurer via the examination / payment institution and a re-examination request is made. Therefore, the new and old first hash values can coexist for about three months. Therefore, when connecting a receipt based on the match of the second hash value, not only the same month but also the previous month and the month before the last receipt may overlap. Even in such a case, the receipt group having the first receipt occurrence month after the last receipt occurrence month is the same person as the receipt group having the first receipt occurrence month in the previous month or the month before the last receipt occurrence month. Since it is considered that the possibility is high, it is preferable to perform connection in the order of the previous month and the month after the last month after the last receipt occurrence month.
If the first receipt generation month of the same first or second hash value candidate is not the same, if the first receipt generation month is the same, which receptor group should be connected? This is because judgment is difficult.

上記６）において、それぞれのレセプトグループにおいて、第１のハッシュ値又は第２のハッシュ値の何れかのみ一致する場合、最終のレセプト発生月を連結対象とするレセプトグループの連結候補を、第１のハッシュ値が一致、最も時系列的に近い、同年月を除き期間がオーバーラップしない、及び、同じ第１若しくは第２のハッシュ値の候補の最初のレセプト発生月が同一ではない、の全てを満足するレセプトグループと連結するのがより好ましい。 In the above 6), when only one of the first hash value and the second hash value matches in each of the reception groups, the connection candidate of the reception group whose connection target is the last reception month is the first Satisfies all the cases where the hash values match, the closest in time series, the periods do not overlap except for the same year and month, and the first receipt occurrence month of the same first or second hash value candidate is not the same It is more preferable to connect with the receiving group.

また、死亡転帰が示されるレセプトグループでは、その最終月から３か月を超えた最初のレセプト発生月を有する他のレセプトグループとは連結しないことが好ましい。死亡転帰が示されるレセプトグループであっても、患者の死亡直後では、紐付けすべきレセプトも存在するが、３か月を超えた場合には、紐付けする必要はないと考えられるからである。 Moreover, it is preferable not to connect with the other receipt group which has the first receipt generation | occurrence | production month more than three months from the last month in the receipt group in which a death outcome is shown. Even in the case of a receptor group that shows a death outcome, there is a receptor that should be linked immediately after the patient's death, but if it exceeds 3 months, it is considered unnecessary to link it. .

また、レセプト情報における医療機関所在地、診療開始年月日、傷病名、又は、患者の年齢階級の少なくとも何れかが一致するレセプトグループと連結することでもよい。例えば、同じハッシュ値であったとしても、遠く離れた土地でまったく異なる病名で受診した患者は別人物と考えるのが妥当であるし、異なるハッシュ値であったとしても、受診地や年齢階級、病名が一致する患者は同一人物である可能性が高まるからである。
なお、ここで一致とは、厳密な一致を要求するものではなく、近似した情報を含めてもよい。例えば、医療機関所在地の一致とは、同一の市区町村であることとしてもよいし、より広く同一の都道府県であることとしてもよい。また、傷病名が表現が異なるが実質的に同一の疾病である場合なども一致と判断できる。 Further, it may be connected to a reception group in which at least one of the location of the medical institution, the start date of medical treatment, the name of the sickness, and the age group of the patient in the reception information match. For example, even if the hash value is the same, it is appropriate to consider a patient who visited a distant place with a completely different disease name as a different person, and even if it has a different hash value, This is because there is an increased possibility that patients with matching disease names are the same person.
Here, the term “match” does not require exact matching but may include approximate information. For example, the coincidence of medical institution locations may be the same municipality, or may be more widely the same prefecture. In addition, it can be determined that the names match even when the names of wounds are different in expression but are substantially the same disease.

本発明の患者突合プログラムは、上記の患者突合方法における各ステップを、コンピュータに実行させるプログラムである。 The patient matching program of the present invention is a program that causes a computer to execute each step in the above-described patient matching method.

本発明の患者突合装置は、保険者番号に基づく第１のハッシュ値と氏名に基づく第２のハッシュ値の少なくとも２つの検索キーを有するレセプト情報データベースの突合を行う装置であって、下記ａ）〜ｄ）の手段を備え、データベースにおけるデータを突合して患者の名寄せを行う。
ａ）調剤レセプト以外のレセプトを入力し、第１のハッシュ値と第２のハッシュ値及び診療年月を少なくとも抽出し、診療年月の複数月にわたる第１のハッシュ値と第２のハッシュ値の少なくとも何れかの一致を判断し、該判断に基づいて同一患者のレセプトを連結し、第１の名寄せ用中間テーブルを生成する第１の名寄せ用中間テーブル生成手段。
ｂ）調剤レセプトを入力し、第１のハッシュ値と第２のハッシュ値及び診療年月を少なくとも抽出し、診療年月の複数月にわたる第１のハッシュ値と第２のハッシュ値の少なくとも何れかの一致を判断し、該判断に基づいて同一患者のレセプトを連結し、第２の名寄せ用中間テーブルを生成する第２の名寄せ用中間テーブル生成手段。
ｃ）同一患者の第１の名寄せ用中間テーブルと第２の名寄せ用中間テーブルを診療年月の時系列に沿って統合し、同一患者における第１のハッシュ値の対応テーブルを生成する対応テーブル生成手段。
ｄ）対応テーブルにおいて複数種の第１のハッシュ値が存在する場合には、複数種の第１のハッシュ値をユニークな第１のハッシュ値に置き換えた名寄せテーブルを生成する名寄せテーブル生成手段。 The patient matching device of the present invention is a device for matching a receipt information database having at least two search keys of a first hash value based on an insurer number and a second hash value based on a name. To d) and collate the data in the database to identify patients.
a) Input a receipt other than the dispensing receipt, extract at least the first hash value, the second hash value, and the medical date, and the first hash value and the second hash value over a plurality of months of the medical date First name identification intermediate table generation means for determining at least one of the matches and connecting the receipts of the same patient based on the determination to generate a first name identification intermediate table.
b) Input a dispensing receipt, extract at least the first hash value, the second hash value, and the medical date, and at least one of the first hash value and the second hash value over a plurality of months of the medical date A second name identification intermediate table generation unit that generates a second name identification intermediate table by determining a match between the same patients and connecting the receipts of the same patient based on the determination.
c) Correspondence table generation that integrates the first name identification intermediate table and the second name identification intermediate table of the same patient according to the time series of the medical treatment date and generates the correspondence table of the first hash value in the same patient. means.
d) Name collation table generating means for generating a name collation table in which plural types of first hash values are replaced with unique first hash values when there are plural types of first hash values in the correspondence table.

本発明によれば、ＮＤＢ等のレセプト情報データベースにおける名寄せの効率及び正確性を向上できるといった効果がある。また、これにより、「ある薬の処方数」だけではなく、「薬を飲んでいる患者の数」や「それらの患者が受けている医療行為の全体像」を臨床的に意味のある正確さで集計できるといった効果がある。 According to the present invention, there is an effect that the efficiency and accuracy of name identification in a receipt information database such as NDB can be improved. In addition to this, not only “the number of prescriptions for a certain drug” but also “the number of patients taking the drug” and “the overall picture of medical practices that those patients are receiving” are clinically meaningful. There is an effect that it can be totaled.

実施例１の患者突合方法のフロー図Flow chart of patient matching method of Example 1 第１の名寄せ用中間テーブル生成フロー図First name identification intermediate table generation flow chart ＩＤ１及びＩＤ２が同一の場合のグラフGraph when ID1 and ID2 are the same ＩＤ１とＩＤ２の紐付けの優先順位を示すグラフGraph showing the priority of ID1 and ID2 linking ＩＤ１を共通とするレセプトグループ同士の紐付けフロー図Flow chart for linking ID groups with a common ID group 紐付けにおける最終のレセプト発生月と最初のレセプト発生月の関係を示すグラフGraph showing the relationship between the last receipt occurrence month and the first receipt occurrence month in pegging 最初のレセプト発生月が同一である場合を示すグラフGraph showing the case where the first receipt month is the same 最初のレセプト発生月が異なる場合の紐付けの優先順位を示すグラフGraph showing the priority of pegging when the first receipt month is different 死亡転帰から３か月を超えた最初のレセプト発生月を有する場合を示すグラフGraph showing the first month of receipt exceeding 3 months from death outcome ＩＤ２を共通とするレセプトグループ同士の紐付けフロー図Flow chart for linking ID groups with common ID groups ＩＤ１が途切れた年月の前月に最初のレセプト発生月が存在する場合のグラフGraph when the first receipt occurrence month exists in the month before ID1 ＩＤ１が途切れた年月の前々月に最初のレセプト発生月が存在する場合のグラフGraph when the first receipt occurrence month exists two months before the year when ID1 was interrupted 第２の名寄せ用中間テーブル生成フロー図Second name identification intermediate table generation flowchart 名寄せ用中間テーブルの統合フロー図Integration flow diagram of intermediate table for name identification 循環参照の例を示すグラフChart showing examples of circular references 名寄せテーブル生成フロー図Name identification table generation flow chart ＩＤ０とＩＤ１の名寄せ精度の比較グラフComparison graph of name identification accuracy of ID0 and ID1 患者突合装置の説明図Explanatory drawing of patient matching device

以下、本発明の実施形態の一例を、図面を参照しながら詳細に説明していく。なお、本発明の範囲は、以下の実施例や図示例に限定されるものではなく、幾多の変更及び変形が可能である。 Hereinafter, an example of an embodiment of the present invention will be described in detail with reference to the drawings. The scope of the present invention is not limited to the following examples and illustrated examples, and many changes and modifications can be made.

本実施例では、平成２５年４月〜平成２６年３月の計１２か月分の医科入院レセプト、医科入院外レセプト、ＤＰＣレセプト、調剤レセプト全体を対象としている。ＤＰＣとは、平成１５年に導入された診断群分類に基づく入院医療費の１日あたり包括支払い制度であり、ＤＰＣ対象の入院で発生するレセプトの内、包括支払いに関連するレセプトがＤＰＣレセプトとなる。したがって、包括支払いに関連のない入院レセプトは医科入院レセプトとなる。これらのレセプトデータを使用して、複数月に渡るＩＤ等の変化を観察し、ＩＤ１、ＩＤ２のほか、診療年月、転帰区分を利用し作成した患者突合方法について説明する。
外来では、医師が発行する処方箋に基づき院外の薬局で調剤が行われることが多く、医科入院外と調剤についてはペアでレセプトが発生することが多い。そのため、一括処理を行うと患者の紐付け情報が多対多となり、純正なマッチングが困難になる。そこで、本実施例は、ＤＰＣ及び医科（入院／入院外）レセプトを用いた第１の名寄せ用中間テーブルと調剤レセプトを用いた第２の名寄せ用中間テーブルを別々に生成し、両者を突合して名寄せを完成させる仕組みとなっている。
なお、本実施例とは異なり、第１の名寄せ用中間テーブルの生成に、特定健診等のデータを用いてもよい。 In this embodiment, the entire 12 months of medical inpatient reception, non-medical inpatient receipt, DPC receptor, and dispensing receipt for April 2013 to March 2014 are targeted. DPC is a comprehensive payment system for inpatient medical expenses per day based on the diagnostic group classification introduced in 2003. Among the receipts that occur during hospitalization of DPCs, the receipt related to comprehensive payment is the DPC receipt. Become. Therefore, an inpatient receipt not related to the comprehensive payment becomes a medical inpatient receipt. By using these receipt data, changes in ID and the like over a plurality of months will be observed, and the patient matching method created using the medical treatment date and outcome category in addition to ID1 and ID2 will be described.
In outpatients, dispensing is often performed at an out-of-hospital pharmacy based on a prescription issued by a doctor. Receipts often occur outside the hospital and in pairs. Therefore, when batch processing is performed, patient association information becomes many-to-many and genuine matching becomes difficult. Therefore, in this embodiment, a first name identification intermediate table using a DPC and a medical department (in-patient / out-of-hospital) receipt and a second name identification intermediate table using a dispensing receipt are generated separately, and the two are combined. It is a mechanism to complete name identification.
Unlike the present embodiment, data such as a specific medical examination may be used to generate the first name identification intermediate table.

図１は、実施例１の患者突合方法のフロー図を示している。図１に示すように、まず、医科入院レセプト、医科入院外レセプト及びＤＰＣレセプトを用いて、第１の名寄せ用中間テーブルを生成する（ステップＳ０１）。次に、調剤レセプトを用いて、第２の名寄せ用中間テーブルを生成する（ステップＳ０２）。第１の名寄せ用中間テーブルと第２の名寄せ用中間テーブルを統合する（ステップＳ０３）。最後に、名寄せテーブルを生成する（ステップＳ０４）。 FIG. 1 shows a flowchart of the patient matching method of the first embodiment. As shown in FIG. 1, first, a first name identification intermediate table is generated using a medical inpatient receipt, a non-medical inpatient receipt, and a DPC receptor (step S01). Next, a second name identification intermediate table is generated using the dispensing receipt (step S02). The first name identification intermediate table and the second name identification intermediate table are integrated (step S03). Finally, a name identification table is generated (step S04).

（第１の名寄せ用中間テーブル生成について）
図２は、第１の名寄せ用中間テーブル生成フロー図を示している。図１では名寄せテーブル生成の全体フローを示したが、図２では、個別のレセプトに着目して以下、説明を行う。図２に示すように、第１の名寄せ用中間テーブル生成においては、医科入院レセプト、医科入院外レセプト及びＤＰＣレセプトからＩＤ１、ＩＤ２、診療年月、転帰区分を抽出する（ステップＳ１１）。なおＤＰＣレセプトとは、ＤＰＣ入院の総括対象医科入院レセプト、ＤＰＣ入院のＤＰＣレセプト、及び、ＤＰＣ入院の総括対象ＤＰＣレセプトのことを指している。また、ステップＳ１１で示した転帰区分については死亡情報を抽出している。後述するが、レセプトデータに死亡転帰が示されている場合は、その月から３か月以内のレセプトデータのみを紐付け、３か月を超えたレセプトデータについては紐付けを行わない。 (About the first name identification intermediate table generation)
FIG. 2 shows a first name identification intermediate table generation flowchart. Although FIG. 1 shows the overall flow of name identification table generation, FIG. 2 will be described below focusing on individual receipts. As shown in FIG. 2, in the first name identification intermediate table generation, ID1, ID2, medical treatment date, and outcome category are extracted from the medical inpatient receipt, the non-medical inpatient receipt, and the DPC receipt (step S11). The DPC receptor refers to a general hospitalization receipt for DPC hospitalization, a DPC reception for DPC hospitalization, and a DPC reception for generalization of DPC hospitalization. Also, death information is extracted for the outcome category shown in step S11. As will be described later, when the death result is indicated in the receipt data, only the receipt data within three months from that month is linked, and the receipt data exceeding 3 months is not linked.

抽出されたデータを基に、まず、複数月に渡ってＩＤ１及びＩＤ２が同じ患者は同一人物とする（ステップＳ１２）。図３は、ＩＤ１及びＩＤ２が同一の場合のグラフを示している。図３に示すように、レセプトグループＡは、ＩＤ１が“１３０”、ＩＤ２が“４７９”で共通するレセプトをグループ化している。レセプトグループＡでは、２０１５年４月、６月、１０〜１２月について共通のＩＤ１及びＩＤ２となっているが、診察にかからなかった月も含め、１つのグループとしている。このように紐付けたグループにおける最初のレセプト発生月及び最終のレセプト発生月について、他のレセプトグループと比較する。 Based on the extracted data, first, patients having the same ID1 and ID2 are made the same person over a plurality of months (step S12). FIG. 3 shows a graph when ID1 and ID2 are the same. As shown in FIG. 3, the reception group A groups receptions having a common ID 1 of “130” and ID 2 of “479”. In the receipt group A, ID1 and ID2 are common for April, June, and October to December 2015, but one group is included, including the month when the examination did not take place. The first receipt occurrence month and the last receipt occurrence month in the group linked in this way are compared with other receipt groups.

レセプトグループ同士の比較は、ＩＤ１とＩＤ２の内、ＩＤ１を優先して行う。図４は、ＩＤ１とＩＤ２の紐付けの優先順位を示すグラフである。図４に示すように、レセプトグループＢのＩＤ１は“１３０”であり、レセプトグループＡと共通である。また、レセプトグループＢのＩＤ２は“４７９”であり、レセプトグループＣと共通である。年月についてみると、レセプトグループＡの最初のレセプト発生月は２０１６年２月で、レセプトグループＣの最初のレセプト発生月は２０１５年１２月であり、レセプトグループＢの最終のレセプト発生月である２０１５年１０月とより近いのはレセプトグループＣであるといえる。
しかしながら、ＩＤ１とＩＤ２のどちらにも紐付け可能な場合は、ＩＤ１を優先する。これは、ＩＤ２では同姓同名・同一生年月日・同性患者が同一ＩＤとなってしまうが，ＩＤ１では、同じ扶養に入っている同性の双子等を除き、別人が同一ＩＤとなる頻度が比較的少ないからである。
したがって、ＩＤ１を共通とするレセプトグループが存在する（ステップＳ１３）場合には、ＩＤ１を共通とするレセプトグループ同士を紐付けする（ステップＳ１４）。その後、ＩＤ２を共通とするレセプトグループが存在するかの判断を行い（ステップＳ１５）、ＩＤ２を共通とするレセプトグループが存在する場合には、ＩＤ２を共通とするレセプトグループ同士を紐付けする（ステップＳ１６）。 The comparison between the reception groups is performed with priority given to ID1 out of ID1 and ID2. FIG. 4 is a graph showing the priority order of linking ID1 and ID2. As shown in FIG. 4, the ID 1 of the receipt group B is “130” and is common to the receipt group A. The ID 2 of the receipt group B is “479”, which is the same as the reception group C. As for the year and month, the first receipt of Receipt Group A is February 2016, the first receipt of Receipt Group C is December 2015, and the last Receipt of Receipt Group B is the receipt month. It can be said that the reception group C is closer to October 2015.
However, ID1 is given priority when it can be linked to both ID1 and ID2. In ID2, the same name, same date of birth, and same-sex patient will have the same ID in ID2, but in ID1, the frequency with which another person has the same ID is relatively high except for the same-sex twins who are in the same support. Because there are few.
Therefore, when there is a receive group having a common ID1 (step S13), the receive groups having a common ID1 are associated with each other (step S14). Thereafter, it is determined whether there is a reception group having a common ID2 (step S15). If there is a reception group having a common ID2, the reception groups having a common ID2 are linked (step S15). S16).

（ＩＤ１を共通とするレセプトグループ同士の紐付けについて）
図５は、ＩＤ１を共通とするレセプトグループ同士の紐付けフロー図を示している。図５に示すように、まず、最初のレセプト発生月が先のレセプトグループの最終のレセプト発生月と同年月以降である、後のレセプトグループを抽出する（ステップＳ２１）。図６は、紐付けにおける最終のレセプト発生月と最初のレセプト発生月の関係を示すグラフである。図６に示すように、レセプトグループＡのＩＤ１とレセプトグループＢのＩＤ１はいずれも“１３０”であり共通している。そして、レセプトグループＡの最終のレセプト発生月は２０１５年の１０月であり、レセプトグループＢの最初のレセプト発生月も２０１５年の１０月であるため、後のレセプトグループであるレセプトグループＢは、最初のレセプト発生月が先のレセプトグループであるレセプトグループＡの最終のレセプト発生月と同年月となり、紐付け可能となる。これとは異なり、例えば、レセプトグループＢの最初のレセプト発生月が２０１５年の９月であるような場合には、上記条件を充たさず、紐付けは行わないこととなる。 (About linking of receipt groups that share ID1)
FIG. 5 shows a flow chart for associating receipt groups having a common ID1. As shown in FIG. 5, first, a subsequent receipt group whose first receipt occurrence month is the same month or later as the last receipt occurrence month of the previous receipt group is extracted (step S21). FIG. 6 is a graph showing the relationship between the last receipt occurrence month and the first receipt occurrence month in pegging. As shown in FIG. 6, ID1 of the receipt group A and ID1 of the receipt group B are both “130” and are common. And since the last receipt occurrence month of the receipt group A is October of 2015 and the first receipt occurrence month of the receipt group B is October of 2015, the reception group B, which is the later receipt group, The first receipt occurrence month is the same month as the last receipt occurrence month of the reception group A, which is the previous receipt group, and can be linked. On the other hand, for example, when the first receipt occurrence month of the receipt group B is September 2015, the above condition is not satisfied and the linking is not performed.

次に、抽出された後のレセプトグループが複数存在しないかにつき判断がなされ（ステップＳ２２）、１つしか存在しない場合には、該レセプトグループを選択する（ステップＳ２３）。これに対して、抽出された後のレセプトグループが複数存在する場合には、後のレセプトグループ同士の最初のレセプト発生月は異なるかの判断がなされる（ステップＳ２４）。
ここで、後のレセプトグループ同士の最初のレセプト発生月は同一である場合には紐付けはなされない（ステップＳ２８）。図７は、最初のレセプト発生月が同一である場合を示すグラフである。図７に示すように、レセプトグループＡ、Ｂ及びＣのＩＤ１は、いずれも“１３０”であり共通している。しかしながら、レセプトグループＡの最終のレセプト発生月に対するレセプトグループＢ及びＣの最初のレセプト発生月はいずれも２０１６年１月であり、最初のレセプト発生月が同一であるといえる。したがって、レセプトグループＡは、Ｂ又はＣのいずれとも紐付けはなされないこととなる。 Next, a determination is made as to whether there are a plurality of extracted reception groups (step S22). If only one reception group exists, the reception group is selected (step S23). On the other hand, when there are a plurality of extracted reception groups, it is determined whether the first reception occurrence months of the subsequent reception groups are different (step S24).
Here, when the first receipt generation months of the subsequent receipt groups are the same, no association is made (step S28). FIG. 7 is a graph showing a case where the first receipt occurrence month is the same. As shown in FIG. 7, the IDs 1 of the receipt groups A, B, and C are all “130” and are common. However, the first receipt occurrence months of the receipt groups B and C with respect to the final receipt occurrence month of the receipt group A are both January 2016, and it can be said that the first receipt occurrence month is the same. Therefore, the receipt group A is not associated with either B or C.

図５のステップＳ２４において、後のレセプトグループ同士の最初のレセプト発生月が異なる場合には、先のレセプトグループの最終のレセプト発生月に最も近い最初のレセプト発生月を有する後のレセプトグループを選択する（ステップＳ２５）。図８は、最初のレセプト発生月が異なる場合の紐付けの優先順位を示すグラフである。図８に示すように、レセプトグループＡ、Ｂ及びＣのＩＤ１は、いずれも“２６０”であり共通している。しかしながら、レセプトグループＡの最終のレセプト発生月２０１５年３月に対して、レセプトグループＢの最初のレセプト発生月は２０１５年５月、レセプトグループＣの最初のレセプト発生月は２０１６年１月となっている。このような場合には、レセプトグループＡは、レセプトグループＡの最終のレセプト発生月２０１５年３月により近い最初のレセプト発生月を有するレセプトグループＢと紐付けすることとなる。 In step S24 of FIG. 5, when the first receipt generation months of the subsequent reception groups are different, the subsequent reception group having the first reception generation month closest to the final reception generation month of the previous reception group is selected. (Step S25). FIG. 8 is a graph showing the priority order of linking when the first receipt occurrence month is different. As shown in FIG. 8, the IDs 1 of the receipt groups A, B, and C are all “260” and are common. However, the first month of receipt of receipt group B is May 2015 and the first month of receipt of receipt group C is January 2016, compared to March 2015, the last month of receipt of receipt group A. ing. In such a case, the receipt group A is linked to the receipt group B having the first receipt occurrence month closer to the last receipt occurrence month of the receipt group A in March 2015.

図５のステップＳ２３及びステップＳ２５において、後のレセプトグループが選択された場合でも、先のレセプトグループの転帰区分に死亡転帰が示されている場合には、先のレセプトグループの最終のレセプト発生月から３か月を超えた最初のレセプト発生月を有する後のレセプトグループ（ステップＳ２６）については、紐付けがなされない（ステップＳ２８）。
これに対して、先のレセプトグループの転帰区分に死亡転帰が示されていない場合、又は、死亡転帰が示されているが、選択された後のレセプトグループが、先のレセプトグループの最終のレセプト発生月から３か月以内の最初のレセプト発生月を有する場合には、紐付けがなされる（ステップＳ２７）。
図９は、死亡転帰から３か月を超えた最初のレセプト発生月を有する場合を示すグラフである。図９に示すように、レセプトグループＡ及びＢのＩＤ１はいずれも“８６３”で共通している。しかしながら、レセプトグループＡには、図示しないが最終のレセプト発生月である２０１５年１０月のレセプトに死亡転帰が示されている。そして、レセプトグループＢの最初のレセプト発生月は２０１６年２月であり、死亡転帰が示されたレセプトグループであるレセプトグループＡの最終のレセプト発生月から３か月を超えている。したがって、レセプトグループＡとＢは紐付けされないこととなる。 Even if the later receptor group is selected in step S23 and step S25 of FIG. 5, if the death outcome is indicated in the outcome category of the earlier receptor group, the last month of receipt of the earlier receptor group The subsequent receipt group (step S26) having the first receipt generation month exceeding 3 months is not linked (step S28).
On the other hand, if the death category is not indicated in the outcome category of the previous receptor group, or if the death outcome is indicated, but the selected receptor group is the final receptor of the previous receptor group When it has the first receipt occurrence month within 3 months from the occurrence month, it is linked (step S27).
FIG. 9 is a graph showing the case of having the first receipt month exceeding 3 months from the death outcome. As shown in FIG. 9, the IDs 1 of the receipt groups A and B are both “863”. However, in the receipt group A, although not shown, the death outcome is shown in the receipt in October 2015, which is the last receipt month. And the first receipt occurrence month of the receipt group B is February 2016, and has exceeded three months from the last receipt occurrence month of the receipt group A which is the receipt group in which the death outcome was shown. Therefore, the receipt groups A and B are not linked.

（ＩＤ２を共通とするレセプトグループ同士の紐付けについて）
図２に示すように、ＩＤ１の紐付け（ステップＳ１４）の後、ＩＤ１についての途切れの前後で同一のＩＤ２を有するレセプトグループが存在する場合（ステップＳ１５）には、紐付けを行う（ステップＳ１６）。
ＩＤ２を共通とするレセプト同士の紐付けは、資格喪失したレセプトが存在することを考慮したアルゴリズムで処理を行う。なぜなら資格喪失したレセプトについては審査支払機関を経由し保険者でチェックを行い再審査請求となるため、３か月間程度は新旧のＩＤ１が併存しうるからである。具体的な処理について、図１０を参照しながら説明する。 (About linking of receipt groups that share ID2)
As shown in FIG. 2, after ID1 is linked (step S14), if there is a receipt group having the same ID2 before and after the break for ID1 (step S15), the link is performed (step S16). ).
The association between the receipts having the same ID2 is performed by an algorithm that takes into account the existence of a loss of qualification. The reason is that the old and new ID1 can coexist for about three months because the insured person checks the receipt of the lost qualification through the examination and payment institution and makes a re-examination request. Specific processing will be described with reference to FIG.

図１０は、ＩＤ２を共通とするレセプトグループ同士の紐付けフロー図を示している。図１０に示すように、まず、ＩＤ１が途切れた年月以降の、共通のＩＤ２を有するレセプトグループの内、最もＩＤ１が途切れた年月に近いものを探索する（ステップＳ３０１）。
ステップＳ３０１における探索により、ＩＤ１が途切れた年月以降のレセプトグループが検出されなかった場合（ステップＳ３０２）は、ＩＤ１が途切れた年月の前月において、共通のＩＤ２を有するレセプトグループを探索する（ステップＳ３０３）。図１１は、ＩＤ１が途切れた年月の前月に最初のレセプト発生月が存在する場合のグラフである。図１１に示すように、レセプトグループＡとレセプトグループＢのＩＤ２はいずれも“４７９”であり共通している。そして、レセプトグループＢの最初のレセプト発生月は、レセプトグループＡの最終のレセプト発生月の前月となっている。したがって、図１１に示すような場合には、レセプトグループＡとレセプトグループＢを紐付けし得ることとなる。 FIG. 10 shows a flow chart for associating receipt groups having a common ID2. As shown in FIG. 10, first, among the receipt groups having a common ID2 after the date when ID1 is interrupted, the one closest to the date when ID1 is interrupted is searched (step S301).
When the reception group after the year when ID1 is interrupted is not detected by the search in step S301 (step S302), the group having the common ID2 is searched for in the month before the year when ID1 is interrupted (step S302). S303). FIG. 11 is a graph in the case where the first receipt occurrence month exists in the month before the year when ID1 was interrupted. As shown in FIG. 11, the ID 2 of the reception group A and the reception group B is “479” and is common. And the first receipt occurrence month of the receipt group B is the month before the last receipt occurrence month of the receipt group A. Therefore, in the case shown in FIG. 11, the receipt group A and the receipt group B can be linked.

ステップＳ３０３における探索により、ＩＤ１が途切れた年月の前月において、共通のＩＤ２を有するレセプトグループが検出されなかった場合（ステップＳ３０４）は、ＩＤ１が途切れた年月の前々月において、共通のＩＤ２を有するレセプトグループを探索する（ステップＳ３０５）。図１２は、ＩＤ１が途切れた年月の前々月に最初のレセプト発生月が存在する場合のグラフである。図１２に示すように、レセプトグループＡとレセプトグループＢのＩＤ２はいずれも“４７９”であり共通している。そして、レセプトグループＢの最初のレセプト発生月は、レセプトグループＡの最終のレセプト発生月の前々月となっている。したがって、図１２に示すような場合には、レセプトグループＡとレセプトグループＢを紐付けし得ることとなる。 If the receipt group having the common ID2 is not detected in the month before the year when ID1 is interrupted by the search in step S303 (step S304), the ID2 is common in the month two months before the year when ID1 is interrupted. A receipt group is searched (step S305). FIG. 12 is a graph in the case where the first receipt occurrence month exists in the month two months before the year and month when ID1 was interrupted. As shown in FIG. 12, the ID 2 of the receipt group A and the receipt group B are both “479” and are common. Then, the first receipt occurrence month of the receipt group B is two months before the final receipt occurrence month of the receipt group A. Therefore, in the case as shown in FIG. 12, the receipt group A and the receipt group B can be linked.

上記ステップＳ３０２においてＩＤ１が途切れた年月に近いレセプトグループが検出された場合であっても、先のレセプトグループの死亡転帰から３か月を超えた最初のレセプト発生月を有する後のレセプトグループである場合（ステップＳ３０７）には、紐付けはなされない（ステップＳ３１０）。
これに対して、先のレセプトグループの死亡転帰から３か月を超えた最初のレセプト発生月を有する後のレセプトグループではない場合、又は、ステップＳ３０４若しくはステップＳ３０６のいずれかにおいて共通のＩＤ２を有するレセプトグループが検出された場合には、最初のレセプト発生月が同一である複数のレセプトグループが検出されたかの判断（ステップＳ３０８）がなされる。ここで、最初のレセプト発生月が同一である複数のレセプトグループが検出された場合には紐付けはなされない（ステップＳ３１１）。これに対して、最初のレセプト発生月が同一である複数のレセプトグループが検出されなかった場合は、ＩＤ１が途切れた年月に近いレセプトグループを紐付けする（ステップＳ３０９）。 Even in the case where a reception group close to the year and month when ID1 is interrupted is detected in step S302 above, it is a subsequent reception group having the first reception generation month that exceeds three months from the death outcome of the previous reception group. If there is any (step S307), no association is made (step S310).
On the other hand, if it is not a subsequent receipt group having the first receipt occurrence month exceeding 3 months from the death outcome of the previous receipt group, or has a common ID2 in either step S304 or step S306 If a receipt group is detected, a determination is made as to whether or not a plurality of receipt groups having the same initial receipt month have been detected (step S308). Here, when a plurality of receipt groups having the same first receipt month are detected, no association is made (step S311). On the other hand, when a plurality of receipt groups having the same first receipt month are not detected, a receipt group close to the year and month when ID1 is interrupted is linked (step S309).

（第２の名寄せ用中間テーブル生成について）
図１に示す第２の名寄せ用中間テーブルの生成（ステップＳ０２）について図１３を参照しながら説明する。図１３は、第２の名寄せ用中間テーブル生成フロー図を示している。図１３に示すように、まず、調剤レセプトからＩＤ１、ＩＤ２、診療年月を抽出する（ステップＳ４１）。次に、ＩＤ１が同一のレセプトを同一人物のものとしてグループ化する（ステップＳ４２）。ＩＤ２を共通とするレセプトグループが存在する場合（ステップＳ４３）には、ＩＤ２を共通とするレセプトグループと紐付けを行う（ステップＳ４４）。
なお、ステップＳ４４における紐付けは、図１０に示す第１の名寄せ用中間テーブル生成において、ＩＤ２を共通とするレセプトグループ同士の紐付けを行う場合と同様の方法により行う。 (About the second name identification intermediate table generation)
Generation of the second name identification intermediate table shown in FIG. 1 (step S02) will be described with reference to FIG. FIG. 13 shows a second name identification intermediate table generation flowchart. As shown in FIG. 13, first, ID1, ID2, and medical date are extracted from the dispensing receipt (step S41). Next, the receipts having the same ID1 are grouped as those of the same person (step S42). When there is a reception group having a common ID2 (step S43), it is linked to a reception group having a common ID2 (step S44).
Note that the linking in step S44 is performed by the same method as in the case of linking the receipt groups having the same ID2 in the first name identification intermediate table generation shown in FIG.

（第１の名寄せ用中間テーブルと第２の名寄せ用中間テーブルの統合について）
図１に示すように、ステップＳ０１において生成した第１の名寄せ用中間テーブルとステップＳ０２において生成した第２の名寄せ用中間テーブルは、ステップＳ０３において統合される。
図１４は、名寄せ用中間テーブルの統合フロー図を示している。図１４に示すように、第１の名寄せ用中間テーブルと第２の名寄せ用中間テーブルを統合し、実際には同一人物であるＩＤ１同士の一対一対応表を生成する（ステップＳ５１）。下記表３は先出ＩＤ１と後出ＩＤ１の一対一対応表を表している。ここで、「先出ＩＤ１」とは、医科、ＤＰＣ、調剤を問わず、一番目に出現したＩＤ１のことであり、「後出ＩＤ１」は二番目以降に出てきたＩＤ１のことである。 (About the integration of the first name identification intermediate table and the second name identification intermediate table)
As shown in FIG. 1, the first name identification intermediate table generated in step S01 and the second name identification intermediate table generated in step S02 are integrated in step S03.
FIG. 14 shows an integration flowchart of the name identification intermediate table. As shown in FIG. 14, the first name identification intermediate table and the second name identification intermediate table are integrated to generate a one-to-one correspondence table of ID1s that are actually the same person (step S51). Table 3 below shows a one-to-one correspondence table for the first ID 1 and the first ID 1. Here, “first-out ID1” refers to ID1 that appears first regardless of medical department, DPC, or dispensing, and “follow-up ID1” refers to ID1 that appears second or later.

上記表３に示す先出ＩＤ１と後出ＩＤ１は、いずれも英数字は異なるがＩＤ２が共通することにより、同一人物と考えられるものである。表３では問題とはならないが、先出ＩＤ１と後出ＩＤ１が入れ替わったものが表示されている場合には、循環参照が問題となる。
図１５は循環参照の例を示すグラフである。図１５に示すように、レセプトグループＡのＩＤ１は“１３５”であり、レセプトグループＢのＩＤ１は、“５１２”であるため、ＩＤ１は異なるが、レセプトグループＡ及びレセプトグループＢのＩＤ２はいずれも“４７９”で共通している。しかしながら、レセプトグループＡ及びレセプトグループＢは、いずれも診療年月が２０１５年４月のみの１つのレセプトからなるレセプトグループであるため、一対一対応表を作成する際には、レセプトグループＡを先出ＩＤ１として、レセプトグループＢを後出ＩＤ１とすることもできるし、逆に、レセプトグループＢを先出ＩＤ１として、レセプトグループＡを後出ＩＤ１とすることも可能である。仮にどちらも入力すると、統合の際に互いが互いを参照するという循環参照が生じてしまうため、この不都合を回避する必要がある。
そこで、循環参照が発生する場合（ステップＳ５２）には、対応セットの一方を削除する（ステップＳ５３）。 The first-out ID1 and the later-out ID1 shown in Table 3 are considered to be the same person because they are different in alphanumeric characters but have the same ID2. In Table 3, this is not a problem, but when the first ID 1 and the last ID 1 are displayed, the circular reference becomes a problem.
FIG. 15 is a graph showing an example of circular reference. As shown in FIG. 15, the ID 1 of the reception group A is “135”, and the ID 1 of the reception group B is “512”. Therefore, the ID 1 is different, but the IDs 2 of the reception group A and the reception group B are both. “479” is common. However, since both the reception group A and the reception group B are reception groups consisting of only one reception with a medical treatment date of only April 2015, when creating a one-to-one correspondence table, the reception group A first. As the outgoing ID 1, the receiving group B can be the subsequent ID 1, and conversely, the receiving group B can be the preceding ID 1 and the receiving group A can be the subsequent ID 1. If both are input, a circular reference that refers to each other at the time of integration occurs, so this inconvenience needs to be avoided.
Therefore, when a circular reference occurs (step S52), one of the correspondence sets is deleted (step S53).

（名寄せテーブル生成について）
図１６は、名寄せテーブル生成フロー図を示している。図１６に示すように、上述の一対一対応表を利用し、後出ＩＤ１を先出ＩＤ１に置き換える（ステップＳ６１）。これにより、同一人物の複数のＩＤ１が一種類に統合される。置き換えられたＩＤ１が別の対応セットの後出ＩＤ１であることもあるため、その場合は、さらに先出ＩＤ１へと置き換える。このように、全ての置き換えが完了していなければ（ステップＳ６２）、再度、後出ＩＤ１を先出ＩＤ１に置き換える（ステップＳ６１）。この作業を後出ＩＤ１がなくなるまで繰り返す。全ての置き換えが完了し、最終的に残ったＩＤ１を新しい名寄せ変数“ＩＤ０”とする（ステップＳ６３）。
なお、本実施例では、先出ＩＤ１をＩＤ０に使用しているが、後出ＩＤ１をＩＤ０に使用することでも構わない。 (About name identification table generation)
FIG. 16 shows a name identification table generation flowchart. As shown in FIG. 16, the above-described one-to-one correspondence table is used to replace the later ID 1 with the first ID 1 (step S61). Thereby, a plurality of IDs 1 of the same person are integrated into one type. Since the replaced ID1 may be a subsequent ID1 in another corresponding set, in this case, the ID1 is further replaced with the previous ID1. As described above, if all the replacements are not completed (step S62), the later ID1 is replaced with the first ID1 again (step S61). This operation is repeated until the later ID1 is exhausted. All replacement is completed, and finally the remaining ID1 is set as a new name identification variable “ID0” (step S63).
In the present embodiment, the previous ID 1 is used for ID0, but the later ID1 may be used for ID0.

具体的に、上記表３で説明する。まず、時系列的に新しいレセプトグループｃから後出ＩＤ１を先出ＩＤ１に置き換える。先出ＩＤ１が“78wmdjfg”、後出ＩＤ１が“Ajdke783”であるレセプトグループｃにおいては、後出ＩＤ１“Ajdke783”は先出ＩＤ１“78wmdjfg”に置き換えられる。また、レセプトグループｃの置き換え前の後出ＩＤ１“Ajdke783”は、レセプトグループｂの先出ＩＤ１“Ajdke783”と同一である。“Ajdke783”は“78wmdjfg”に置き換えられたため、レセプトグループｂの後出ＩＤ１“ue8k22ue”を先出ＩＤ１“Ajdke783”に置き換える際には、“78wmdjfg”に置き換えられる。同様に、レセプトグループｂの置き換え前の後出ＩＤ１“ue8k22ue”は、レセプトグループａの先出ＩＤ１“ue8k22ue”と同一であるため、レセプトグループａの後出ＩＤ１“p8d89jss”を先出ＩＤ１“ue8k22ue”に置き換える際には、“78wmdjfg”に置き換えられる。後出ＩＤ１がなくなるまで繰り返し、全ての置き換えが完了し、最終的に残ったＩＤ１は“78wmdjfg”になる。下記表４は、生成が完了した名寄せテーブルを表している。 This will be specifically described in Table 3 above. First, the succeeding ID 1 is replaced with the preceding ID 1 from the new receipt group c in time series. In the receipt group c in which the first ID 1 is “78 wmdjfg” and the last ID 1 is “Ajdke783”, the last ID 1 “Ajdke783” is replaced with the first ID “78 wmdjfg”. The subsequent ID1 “Ajdke783” before the replacement of the receipt group c is the same as the previous ID1 “Ajdke783” of the receipt group b. Since “Ajdke783” is replaced with “78wmdjfg”, when the succeeding ID 1 “ue8k22ue” of the receipt group b is replaced with the preceding ID 1 “Ajdke783”, it is replaced with “78wmdjfg”. Similarly, the subsequent ID 1 “ue8k22ue” before the replacement of the receipt group b is the same as the previous ID 1 “ue8k22ue” of the receipt group a. When replacing with "78", it is replaced with "78wmdjfg". The process is repeated until there is no later ID1, and all the replacement is completed, and finally the remaining ID1 becomes “78wmdjfg”. Table 4 below shows a name identification table that has been generated.

上記表４に示すように、レセプトグループ（ａ〜ｃ）の全てについて、ＩＤ０“78wmdjfg”が設定されている。本実施例では３回目の更新により名寄せテーブルの生成が完了しているが、実際にはより多数の置き換えが行われる場合も存在する。また、このＩＤ０には、単回受診等で名寄せ対象とならなかったため元々のＩＤ１がそのまま残存したものも含まれる。 As shown in Table 4 above, ID 0 “78wmdjfg” is set for all of the receipt groups (a to c). In this embodiment, the generation of the name identification table is completed by the third update, but there are cases where a larger number of replacements are actually performed. In addition, the ID0 includes those in which the original ID1 remains as it is because it has not been subjected to name identification in a single visit or the like.

なお、レセプトグループにおける先出ＩＤ１と後出ＩＤ１については、古い方から新しい方に向かう時系列に沿った先後関係（古い・・・先；新しい・・・後）であっても、新しい方から古い方に向かう時系列に沿った先後関係（新しい・・・先；古い・・・後）であってもよい。
また、複数のレセプトグループの名寄せの順番については、特に限定されるものではなく、例えば、時系列に沿って古いレセプトを含むレセプトグループから処理してもよいし、或は、新しいレセプトを含むレセプトグループから処理してもよい。 Regarding the first ID1 and the last ID1 in the receipt group, even if it is a first-to-last relationship (old ... first; new ... after) in time sequence from the oldest to the newer, It may be a chronological relationship (new ... first; old ... after) along the time series toward the older one.
In addition, the order of naming the plurality of reception groups is not particularly limited. For example, reception may be performed from a reception group including old receptions in time series, or reception including new receptions. You may process from a group.

（ＩＤ０の妥当性について）
図１７は、ＩＤ０とＩＤ１の名寄せ精度の比較グラフであり、具体的には、平成２５年度の１年分の患者（ＤＰＣ、医科入院、医科入院外、調剤）を対象に、ＩＤ０にて名寄せを実施し、従来のＩＤ１による名寄せ患者数及び平成２５年１０月の推計人口と比較した性年齢階級別の結果を示している。なお、（１）は男性、（２）は女性についての結果を示している。図１７（１）及び（２）に示すように、男女とも、ＩＤ０による性年齢階級別患者数はＩＤ１による患者数を下回っており、追加名寄せ率（ＩＤ１名寄せに比してＩＤ０名寄せで同一人物の特定に追加的に成功した割合）は男性で６．２％、女性で７．１％であった。ＩＤ１により名寄せされた患者数は、０〜９歳や７５〜７９歳、９０歳以上、男性の８５〜８９歳、女性の２５〜２９歳において推計人口を大きく上回っていた。一方、ＩＤ０により名寄せされた患者数は、男性の８５歳以上や女性の９０歳以上で推計人口を大きく上回ったが、それ以外の性年齢階級ではおおむね推計人口の範囲内に収まった。 (Validity of ID0)
FIG. 17 is a comparison graph of the identification accuracy of ID0 and ID1, and specifically, identification of ID0 for patients (DPC, medical admission, non-medical admission, dispensing) for the year 2013 The results for each sex age group compared to the number of name-identified patients by conventional ID1 and the estimated population in October 2013 are shown. Note that (1) shows the results for men and (2) shows the results for women. As shown in FIGS. 17 (1) and (2), the number of patients by gender age group by ID0 is lower than the number of patients by ID1 for both men and women, and the additional name identification rate (same person with ID0 grouping compared to ID1 grouping) The rate of additional successful identification) was 6.2% for men and 7.1% for women. The number of patients named by ID1 greatly exceeded the estimated population at 0-9 years, 75-79 years, 90 years and older, 85-89 years for men, and 25-29 years for women. On the other hand, the number of patients named by ID0 greatly exceeded the estimated population at the age of 85 and over for males and 90 and over for females, but in other sex age classes, it was generally within the estimated population.

図１８は、患者突合装置の説明図を示している。図１８に示すように、コンピュータ３には、患者突合装置４及び通信手段４５が備えられている。コンピュータ３とサーバ５は、通信手段４５により、ネットワーク６を介して有線又は無線で接続されている。また、サーバ５にはＮＤＢが設けられている。
患者突合装置４には、第１の名寄せ用中間テーブル生成手段４１、第２の名寄せ用中間テーブル生成手段４２、対応テーブル生成手段４３及び名寄せテーブル生成手段４４が設けられている。 FIG. 18 shows an explanatory diagram of the patient matching apparatus. As shown in FIG. 18, the computer 3 includes a patient matching device 4 and a communication unit 45. The computer 3 and the server 5 are connected to each other by a communication means 45 via a network 6 in a wired or wireless manner. The server 5 is provided with an NDB.
The patient matching apparatus 4 is provided with first name identification intermediate table generation means 41, second name identification intermediate table generation means 42, correspondence table generation means 43, and name identification table generation means 44.

患者突合装置４を稼動する際には、コンピュータ４は、通信手段４５を用いて、サーバ５からデータを受信する。コンピュータ４は、受信したデータを使用して、第１の名寄せ用中間テーブル生成手段４１により第１の名寄せ用中間テーブルを生成する。次に、第２の名寄せ用中間テーブル生成手段４２により第２の名寄せ用中間テーブルを生成する。生成したテーブルを対応テーブル生成手段４３により統合する。統合したテーブルを基に、名寄せテーブル生成手段４４により名寄せテーブルを生成する。
完成した名寄せテーブルは、通信手段４５により、ネットワーク６を介してサーバに送信される。これにより、サーバ５におけるＮＤＢは、より名寄せ精度の向上したデータベースとして利用可能となる。
サーバ５におけるＮＤＢは日々新しい情報に更新されるため、上記のような、患者突合装置４の稼動は、一度だけではなく、定期的に行うことが望ましい。頻繁に患者突合装置４を稼動することで、より名寄せ精度の向上したデータベースとすることが可能である。 When operating the patient matching apparatus 4, the computer 4 receives data from the server 5 using the communication means 45. The computer 4 uses the received data to generate a first name identification intermediate table by the first name identification intermediate table generation means 41. Next, the second name identification intermediate table generation means 42 generates a second name identification intermediate table. The generated table is integrated by the correspondence table generating means 43. A name identification table is generated by the name identification table generation unit 44 based on the integrated table.
The completed name identification table is transmitted to the server via the network 6 by the communication means 45. Thereby, the NDB in the server 5 can be used as a database with improved name identification accuracy.
Since the NDB in the server 5 is updated with new information every day, the operation of the patient matching device 4 as described above is desirably performed not only once but periodically. By frequently operating the patient matching apparatus 4, it is possible to make a database with improved name identification accuracy.

本発明は、レセプト情報データ等の効果的な活用を支援するツールとして有用である。 The present invention is useful as a tool for supporting effective utilization of receipt information data and the like.

３コンピュータ
４患者突合装置
５サーバ
６ネットワーク
４１第１の名寄せ用中間テーブル生成手段
４２第２の名寄せ用中間テーブル生成手段
４３対応テーブル生成手段
４４名寄せテーブル生成手段
４５通信手段 DESCRIPTION OF SYMBOLS 3 Computer 4 Patient matching apparatus 5 Server 6 Network 41 1st name identification intermediate table generation means 42 2nd name identification intermediate table generation means 43 Correspondence table generation means 44 Name identification table generation means 45 Communication means

Claims

A method of matching a receipt information database having at least two search keys, a first hash value based on an insurer number and a second hash value based on a name,
1) At least a first hash value, a second hash value, and a medical date are extracted using a receipt other than the dispensing receipt, and the first hash value and the second hash value over a plurality of months of the medical date Concatenating the receipts based on at least any match to generate a first name identification intermediate table for the same patient;
2) Using the dispensing receipt, extract at least the first hash value, the second hash value, and the medical date, and at least one of the first hash value and the second hash value over a plurality of months of the medical date Concatenating the receipts based on the match and generating a second name identification intermediate table for the same patient;
3) integrating a first name identification intermediate table and a second name identification intermediate table of the same patient along a time series of medical treatment dates, and generating a first hash value correspondence table for the same patient;
4) When there are a plurality of types of first hash values in the correspondence table, a step of generating a name collation table in which the plurality of types of first hash values are replaced with unique first hash values;
A patient collation method comprising: collating data in the database and collating patients.

In the first and second name identification intermediate table generation steps,
If both the first hash value and the second hash value match, they are linked as a receipt for the same patient, and from the first receipt occurrence month to the last receipt occurrence month, or from the first receipt occurrence month to the outcome category Concatenating as a single reception group until the month in which the death outcome is indicated,
When only one of the first hash value and the second hash value is matched in each of the receive groups, the first hash value matches the candidate for concatenation of the receive group whose connection target is the last receipt occurrence month, The first satisfying at least one of the closest in time series, the period does not overlap except for the same month, or the first receipt generation month of the same first or second hash value candidate is not the same Linking a receipt month with a receipt group to be consolidated;
The patient matching method according to claim 1, further comprising:

In the first and second name identification intermediate table generation steps,
If both the first hash value and the second hash value match, they are linked as a receipt for the same patient, and from the first receipt occurrence month to the last receipt occurrence month, or from the first receipt occurrence month to the outcome category Concatenating as a single reception group until the month in which the death outcome is indicated,
When only one of the first hash value and the second hash value is matched in each of the receive groups, the first hash value matches the candidate for concatenation of the receive group whose connection target is the last receipt occurrence month, Linked to a receipt group that satisfies all of the following conditions: the closest in time series, except for the same year and month, the periods do not overlap, and the first receipt generation month of the same first or second hash value candidate is not the same Step to do,
The patient matching method according to claim 1, further comprising:

4. The patient matching method according to claim 2 or 3, wherein a receipt group showing a death outcome is not connected to another receipt group having an initial month of receipt that exceeds three months from the last month. .

5. The method according to claim 2, wherein at least one of a medical institution location, a medical treatment start date, a medical treatment name, and / or a patient age group in the receipt information is connected. Patient matching method.

A patient matching program for causing a computer to execute each step in the patient matching method according to claim 1.

An apparatus for matching a receipt information database having at least two search keys of a first hash value based on an insurer number and a second hash value based on a name,
1) Enter a receipt other than the dispensing receipt, extract at least the first hash value, the second hash value, and the medical date, and the first hash value and the second hash value over a plurality of months of the medical date A first name identification intermediate table generating means for determining at least one of the matches, connecting the receipts of the same patient based on the determination, and generating a first name identification intermediate table;
2) Input a dispensing receipt, extract at least the first hash value, the second hash value, and the medical date, and at least one of the first hash value and the second hash value over a plurality of months of the medical date A second name identification intermediate table generating unit that generates a second name identification intermediate table by connecting the same patient's receipts based on the determination,
3) Correspondence table generation that integrates the first name identification intermediate table and the second name identification intermediate table of the same patient according to the time series of the medical treatment date and generates the first hash value correspondence table for the same patient. Means,
4) When there are a plurality of types of first hash values in the correspondence table, a name identification table generating unit that generates a name identification table in which the plurality of types of first hash values are replaced with unique first hash values;
A patient collating apparatus characterized by collating data in the database and identifying patients.