JP2020181568A

JP2020181568A - Information processing device, information processing method, and information processing program

Info

Publication number: JP2020181568A
Application number: JP2020016027A
Authority: JP
Inventors: 寿彦佐藤; Toshihiko Sato
Original assignee: PREC KK; Precision KK
Current assignee: PREC KK; Precision KK
Priority date: 2020-02-03
Filing date: 2020-02-03
Publication date: 2020-11-05

Abstract

To make it possible to accurately acquire answers written in a medical questionnaire.SOLUTION: A medical examination support device 1 according to one embodiment of the present invention comprises: a layout storage unit 121 to store a layout of a medical questionnaire in association with identification information for identifying the medical questionnaire; a candidate word storage unit 122 to store multiple candidate words; an entry content acquisition unit 111 to acquire the identification information from an image of the medical questionnaire and, on the basis of the layout associated with the acquired identification information, acquire one or more characters included in the image; a narrowing unit 112 to narrow the multiple candidate words on the basis of positions where the one or more characters have been acquired; an answer specification unit 113 to specify answers written in the medical questionnaire by comparing the one or more characters acquired by the entry content acquisition unit 111 to the multiple candidate words narrowed by the narrowing unit 112; and an output unit 114 to output the answers specified by the answer specification unit 113.SELECTED DRAWING: Figure 4

Description

本発明は、問診票に記入された回答を取得する情報処理装置、情報処理方法及び情報処理プログラムに関する。 The present invention relates to an information processing device, an information processing method, and an information processing program for acquiring an answer entered in a questionnaire.

スキャナに紙を読み込ませて画像データを取得し、画像データに対してＯＣＲ（Optical Character Recognition）処理等の文字認識処理を行うことによって、紙に記入された内容を取得するシステムが知られている（例えば、特許文献１参照）。このようなシステムは、紙に記入された内容を、記憶装置に記憶したり、表示装置へ表示したりすることによって、その後の処理を円滑にすることができる。 A system is known in which a scanner reads paper to acquire image data, and then performs character recognition processing such as OCR (Optical Character Recognition) processing on the image data to acquire the contents written on the paper. (See, for example, Patent Document 1). Such a system can facilitate subsequent processing by storing the contents written on the paper in a storage device or displaying them on a display device.

特開２０１２−９０００号公報Japanese Unexamined Patent Publication No. 2012-9000

ところで、患者（受診者）が病院で診療を受ける際には、問診票（質問票ともいう）に自身の症状等の回答を記入する。文字認識処理によって問診票に記入された回答を取得する際に、文字認識処理の誤認識による誤りがある場合や、患者が回答を記入する際に誤字や脱字等の誤りをしている場合がある。特に、医療現場では、患者が障害を持つために記入される文字が粗い場合があり、また高齢者が多いために患者自身がうろ覚えで薬剤名を覚えているため記入される薬剤名が誤っている場合がある。医者や看護師等の医療従事者が、文字認識処理によって取得された回答を参照する際に、このような誤りを手作業で修正するには負担が掛かる。 By the way, when a patient (examinee) receives medical treatment at a hospital, he / she fills in an answer such as his / her symptoms on a questionnaire (also called a questionnaire). When obtaining the answer entered in the questionnaire by the character recognition process, there may be an error due to misrecognition of the character recognition process, or the patient may make an error such as a typographical error or omission when entering the answer. is there. In particular, in the medical field, the characters entered may be rough because the patient has a disability, and because there are many elderly people, the patient himself remembers the drug name, so the drug name entered is incorrect. There may be. When medical professionals such as doctors and nurses refer to the answers obtained by the character recognition process, it is burdensome to manually correct such errors.

本発明は上述の点に鑑みてなされたものであり、問診票に記入された回答を高精度に取得できる情報処理装置、情報処理方法及び情報処理プログラムを提供することを目的とする。 The present invention has been made in view of the above points, and an object of the present invention is to provide an information processing device, an information processing method, and an information processing program capable of obtaining answers entered in a questionnaire with high accuracy.

本発明の第１の態様の情報処理装置は、問診票のレイアウトを、前記問診票を識別する識別情報に関連付けて記憶するレイアウト記憶部と、複数の候補単語を記憶する候補単語記憶部と、前記問診票の画像から前記識別情報を取得し、取得した前記識別情報に関連付けられた前記レイアウトに基づいて、前記画像に含まれる一以上の文字を取得する取得部と、前記一以上の文字が取得された位置に基づいて、前記複数の候補単語を絞り込む絞込部と、前記取得部が取得した前記一以上の文字と前記絞込部が絞り込んだ前記複数の候補単語とを比較することによって、前記問診票に記入された回答を特定する特定部と、前記特定部が特定した前記回答を出力する出力部と、を有する。 The information processing device according to the first aspect of the present invention includes a layout storage unit that stores the layout of the questionnaire in association with the identification information that identifies the questionnaire, a candidate word storage unit that stores a plurality of candidate words, and the like. An acquisition unit that acquires the identification information from the image of the questionnaire and acquires one or more characters included in the image based on the layout associated with the acquired identification information, and the one or more characters. By comparing the narrowing-down unit that narrows down the plurality of candidate words based on the acquired positions with the one or more characters acquired by the acquisition unit and the plurality of candidate words narrowed down by the narrowing-down unit. , A specific unit that specifies the answer entered in the questionnaire, and an output unit that outputs the answer specified by the specific unit.

前記特定部は、前記取得部が取得した前記一以上の文字と前記絞込部が絞り込んだ前記複数の候補単語それぞれとの間の類似度を算出し、算出した前記類似度に基づいて選択した前記複数の候補単語のうち１つを前記回答として特定してもよい。 The specific unit calculates the similarity between the one or more characters acquired by the acquisition unit and each of the plurality of candidate words narrowed down by the narrowing unit, and selects based on the calculated similarity. One of the plurality of candidate words may be specified as the answer.

前記情報処理装置は、２つの文字の間の類似性を示す値を記憶する類似情報記憶部をさらに有し、前記特定部は、前記取得部が取得した前記一以上の文字に含まれる１つの文字又は複数の文字の組み合わせと、前記複数の候補単語それぞれに含まれる１つの文字又は複数の文字の組み合わせとの間の前記値を取得し、取得した前記値を用いて前記類似度を算出してもよい。 The information processing device further has a similar information storage unit that stores a value indicating similarity between two characters, and the specific unit is included in one or more characters acquired by the acquisition unit. The value between a character or a combination of a plurality of characters and one character or a combination of a plurality of characters included in each of the plurality of candidate words is acquired, and the similarity is calculated using the acquired value. You may.

前記レイアウト記憶部は、所定位置を基準とした前記問診票に含まれる回答欄の位置を示す前記レイアウトを記憶し、前記取得部は、前記画像の中で前記レイアウトが示す前記回答欄の位置にある前記一以上の文字を取得してもよい。 The layout storage unit stores the layout indicating the position of the answer column included in the questionnaire based on the predetermined position, and the acquisition unit is located at the position of the answer column indicated by the layout in the image. One or more characters may be acquired.

前記候補単語記憶部は、回答欄に関連付けて前記複数の候補単語を記憶し、前記絞込部は、前記候補単語記憶部から前記一以上の文字が取得された前記回答欄に関連付けられた前記複数の候補単語を取得することによって、前記複数の候補単語を絞り込んでもよい。 The candidate word storage unit stores the plurality of candidate words in association with the answer column, and the narrowing unit is associated with the answer column in which one or more characters are acquired from the candidate word storage unit. The plurality of candidate words may be narrowed down by acquiring a plurality of candidate words.

前記候補単語記憶部は、医学用語に関連付けて前記複数の候補単語を記憶し、前記取得部は、前記一以上の文字に加えて、前記画像において前記一以上の文字が取得された前記回答欄に関連付けられた前記医学用語を取得し、前記絞込部は、前記候補単語記憶部から前記取得部が取得した前記医学用語に関連付けられた前記複数の候補単語を取得することによって、前記複数の候補単語を絞り込み、前記出力部は、前記回答と前記医学用語とを関連付けて出力してもよい。 The candidate word storage unit stores the plurality of candidate words in association with medical terms, and the acquisition unit acquires the one or more characters in the image in addition to the one or more characters. By acquiring the medical term associated with the medical term, and acquiring the plurality of candidate words associated with the medical term acquired by the acquisition unit from the candidate word storage unit, the narrowing unit obtains the plurality of candidate words. Candidate words may be narrowed down, and the output unit may output the answer in association with the medical term.

前記レイアウト記憶部は、前記医学用語に関連付けて前記問診票に記載された質問を記憶し、前記出力部は、前記回答及び前記医学用語に加えて、前記質問を出力してもよい。 The layout storage unit may store the question described in the questionnaire in association with the medical term, and the output unit may output the question in addition to the answer and the medical term.

前記絞込部は、前記特定部が前記一以上の文字が取得された位置の第１の回答欄について特定した前記回答に基づいて、前記第１の回答欄とは異なる第２の回答欄について、前記複数の候補単語を絞り込んでもよい。 The narrowing section is based on the answer specified by the specific section for the first answer column at the position where the one or more characters are acquired, and the second answer column different from the first answer column. , The plurality of candidate words may be narrowed down.

前記候補単語記憶部は、前記複数の候補単語それぞれを、１つの単語の状態と、１つの単語を複数の部分に分割した状態と、複数の単語を結合した状態とのうち少なくとも１つの状態で記憶してもよい。 The candidate word storage unit is in at least one state of each of the plurality of candidate words, one word state, one word divided into a plurality of parts, and a state in which a plurality of words are combined. You may remember.

前記取得部は、前記画像に含まれる所定の２つの記号を結ぶ線分を特定し、前記線分の長さに所定の係数を乗算することによって前記問診票の中で前記回答が記入される可能性のある領域を区画する辺の長さを算出し、前記長さに基づいて前記画像を補正してもよい。 The acquisition unit identifies a line segment connecting two predetermined symbols included in the image, and the answer is entered in the questionnaire by multiplying the length of the line segment by a predetermined coefficient. The length of the sides that partition the potential region may be calculated and the image corrected based on the length.

本発明の第２の態様の情報処理方法は、問診票のレイアウトを、前記問診票を識別する識別情報に関連付けて記憶するレイアウト記憶部と、複数の候補単語を記憶する候補単語記憶部と、を有するコンピュータが、前記問診票の画像から前記識別情報を取得し、取得した前記識別情報に関連付けられた前記レイアウトに基づいて、前記画像に含まれる一以上の文字を取得するステップと、前記一以上の文字が取得された位置に基づいて、前記複数の候補単語を絞り込むステップと、前記取得するステップが取得した前記一以上の文字と前記絞り込むステップが絞り込んだ前記複数の候補単語とを比較することによって、前記問診票に記入された回答を特定するステップと、前記特定するステップが特定した前記回答を出力するステップと、を有する。 In the information processing method of the second aspect of the present invention, a layout storage unit that stores the layout of the questionnaire in association with the identification information that identifies the questionnaire, a candidate word storage unit that stores a plurality of candidate words, and a candidate word storage unit. A step of acquiring the identification information from the image of the questionnaire and acquiring one or more characters included in the image based on the layout associated with the acquired identification information. Based on the position where the above characters are acquired, the step of narrowing down the plurality of candidate words is compared with the one or more characters acquired by the acquisition step and the plurality of candidate words narrowed down by the narrowing down step. Thereby, it has a step of specifying the answer entered in the questionnaire and a step of outputting the answer specified by the specifying step.

本発明の第３の態様の情報処理プログラムは、問診票のレイアウトを、前記問診票を識別する識別情報に関連付けて記憶するレイアウト記憶部と、複数の候補単語を記憶する候補単語記憶部と、を有するコンピュータに、前記問診票の画像から前記識別情報を取得し、取得した前記識別情報に関連付けられた前記レイアウトに基づいて、前記画像に含まれる一以上の文字を取得するステップと、前記一以上の文字が取得された位置に基づいて、前記複数の候補単語を絞り込むステップと、前記取得するステップが取得した前記一以上の文字と前記絞り込むステップが絞り込んだ前記複数の候補単語とを比較することによって、前記問診票に記入された回答を特定するステップと、前記特定するステップが特定した前記回答を出力するステップと、を実行させる。 The information processing program according to the third aspect of the present invention includes a layout storage unit that stores the layout of the questionnaire in association with the identification information that identifies the questionnaire, a candidate word storage unit that stores a plurality of candidate words, and a candidate word storage unit that stores a plurality of candidate words. The step of acquiring the identification information from the image of the questionnaire and acquiring one or more characters included in the image based on the layout associated with the acquired identification information, and the above-mentioned one. Based on the position where the above characters are acquired, the step of narrowing down the plurality of candidate words is compared with the one or more characters acquired by the acquisition step and the plurality of candidate words narrowed down by the narrowing down step. By doing so, the step of specifying the answer entered in the questionnaire and the step of outputting the answer specified by the specifying step are executed.

本発明によれば、問診票に記入された回答を高精度に取得できるという効果を奏する。 According to the present invention, there is an effect that the answer entered in the questionnaire can be obtained with high accuracy.

実施形態に係る診療支援システムの模式図である。It is a schematic diagram of the medical care support system which concerns on embodiment. 実施形態に係る診療支援システムが実行する診療支援方法の模式図である。It is a schematic diagram of the medical care support method executed by the medical care support system which concerns on embodiment. 画像から取得した文字に基づいて回答を特定する方法の模式図である。It is a schematic diagram of the method of specifying an answer based on the character acquired from an image. 実施形態に係る診療支援システムのブロック図である。It is a block diagram of the medical care support system which concerns on embodiment. 例示的な問診票の正面図である。It is a front view of an exemplary questionnaire. 記入内容取得部が回答領域の位置を特定する方法の模式図である。It is a schematic diagram of the method in which the entry content acquisition part specifies the position of the answer area. 絞込部による絞込処理の模式図である。It is a schematic diagram of the narrowing down process by a narrowing down part. 文字間距離を示すマトリクスの模式図である。It is a schematic diagram of the matrix which shows the distance between characters. 診療用端末の表示部の正面図である。It is a front view of the display part of a medical treatment terminal. 実施形態に係る診療支援方法のフローチャートを示す図である。It is a figure which shows the flowchart of the medical care support method which concerns on embodiment.

［診療支援システムＳの概要］
図１は、本実施形態に係る診療支援システムＳの模式図である。診療支援システムＳは、診療支援装置１と、診療用端末２と、読取装置３とを含む。診療支援システムＳは、その他のサーバ、端末等の機器を含んでもよい。 [Outline of medical care support system S]
FIG. 1 is a schematic diagram of a medical care support system S according to the present embodiment. The medical care support system S includes a medical care support device 1, a medical care terminal 2, and a reading device 3. The medical care support system S may include other devices such as servers and terminals.

診療支援装置１（情報処理装置）は、問診票３１（質問票）に記入された患者の回答を取得し、該回答を診療用端末２へ提供するコンピュータである。問診票３１は、ステープラ等の固定具Ｆによって束ねられた一組の紙であり、それぞれの紙は、患者への質問と、患者が質問に対する回答を記入するための回答欄とを含む。 The medical care support device 1 (information processing device) is a computer that acquires the patient's answer entered in the questionnaire 31 (questionnaire) and provides the answer to the medical care terminal 2. The questionnaire 31 is a set of papers bundled by a fixture F such as a stapler, and each paper contains a question to the patient and an answer column for the patient to enter an answer to the question.

診療用端末２は、医師、看護師等の医療従事者に対して情報を表示するとともに、医療従事者による入力を受け付けるコンピュータであり、例えば表示部２１及び操作部２２を有するパーソナルコンピュータである。表示部２１は、液晶ディスプレイ等の表示装置を含む。操作部２２は、キーボード、マウス等の操作装置を含む。診療用端末２は、インターネット、ローカルエリアネットワーク等のネットワークＮを介して、診療支援装置１から表示のための情報を受信するとともに、診療支援装置１へ医療従事者による入力を送信する。 The medical care terminal 2 is a computer that displays information to medical staff such as doctors and nurses and accepts input by the medical staff, for example, a personal computer having a display unit 21 and an operation unit 22. The display unit 21 includes a display device such as a liquid crystal display. The operation unit 22 includes an operation device such as a keyboard and a mouse. The medical care terminal 2 receives information for display from the medical care support device 1 via a network N such as the Internet or a local area network, and transmits input by a medical worker to the medical care support device 1.

読取装置３は、紙の問診票３１の表面を画像として読み取るスキャナである。読取装置３は、紙の表面を読み取って画像データを出力可能なＣＣＤ（Charge Coupled Device）、ＣＭＯＳ（Complementary Metal Oxide Semiconductor）等の撮像素子を含む。読取装置３は、ネットワークＮを介して、診療支援装置１へ問診票３１の画像を送信する。 The reading device 3 is a scanner that reads the surface of the paper questionnaire 31 as an image. The reading device 3 includes an image pickup device such as a CCD (Charge Coupled Device) or a CMOS (Complementary Metal Oxide Semiconductor) capable of reading the surface of paper and outputting image data. The reading device 3 transmits the image of the questionnaire 31 to the medical care support device 1 via the network N.

読取装置３は、デジタルカメラや、カメラを有するスマートフォン等の携帯端末であってもよい。この場合には、医療従事者又は患者は、デジタルカメラ又は携帯端末を用いて問診票３１の正面を撮像することによって、問診票３１の表面を画像として読み取る。 The reading device 3 may be a digital camera or a mobile terminal such as a smartphone having a camera. In this case, the medical staff or the patient reads the surface of the questionnaire 31 as an image by photographing the front surface of the questionnaire 31 using a digital camera or a mobile terminal.

診療支援システムＳは、読取装置３に代えて又は加えて、患者が問診票への回答を入力するための患者用端末（例えばタブレット端末）を有してもよい。この場合に、患者は、電子的な問診票を表示している患者用端末に対して指又はタッチペンを用いて手書き入力をすることによって回答を記入する。患者用端末は、ネットワークＮを介して、患者が電子的な問診票３１上で行った手書き入力の情報を、問診票３１の画像として診療支援装置１へ送信する。 The medical care support system S may have a patient terminal (for example, a tablet terminal) for the patient to input an answer to the questionnaire in place of or in addition to the reading device 3. In this case, the patient fills in the answer by handwriting the patient terminal displaying the electronic questionnaire using a finger or a stylus. The patient terminal transmits the information of the handwritten input performed by the patient on the electronic questionnaire 31 to the medical care support device 1 as an image of the questionnaire 31 via the network N.

図２は、本実施形態に係る診療支援システムＳが実行する診療支援方法（情報処理方法）の模式図である。まず患者は、問診票３１に、問診票３１の質問への回答を記入する。医療従事者又は患者は、問診票３１のうち固定具Ｆを含む一部を切断し、問診票３１を１枚ずつの紙に分ける（ａ）。そして医療従事者又は患者は、読取装置３に問診票３１を読み取らせる（ｂ）。読取装置３は、問診票３１の画像を診療支援装置１へ送信する。 FIG. 2 is a schematic diagram of a medical care support method (information processing method) executed by the medical care support system S according to the present embodiment. First, the patient fills in the answer to the question on the questionnaire 31 on the questionnaire 31. The medical staff or the patient cuts a part of the questionnaire 31 including the fixture F and divides the questionnaire 31 into sheets of paper (a). Then, the medical staff or the patient causes the reading device 3 to read the questionnaire 31 (b). The reading device 3 transmits the image of the questionnaire 31 to the medical care support device 1.

次に診療支援装置１は、問診票３１の画像に基づいて、問診票３１を特定する。問診票３１は、問診票３１（問診票３１の種類）ごとに異なる質問と回答欄を含んでいる。問診票３１は、自身の識別情報を示すコード（例えば二次元バーコード）を表している。診療支援装置１は、画像から該コードを検出することによって問診票３１を特定できる。 Next, the medical care support device 1 identifies the questionnaire 31 based on the image of the questionnaire 31. The questionnaire 31 includes different question and answer columns for each questionnaire 31 (type of questionnaire 31). The questionnaire 31 represents a code (for example, a two-dimensional bar code) indicating its own identification information. The medical care support device 1 can identify the questionnaire 31 by detecting the code from the image.

次に診療支援装置１は、特定した問診票３１のレイアウト情報を記憶部から取得する。診療支援装置１の記憶部は、問診票３１のレイアウト情報を予め記憶している。レイアウト情報は、問診票３１に含まれる各回答欄の位置を示す。そして診療支援装置１は、画像上でレイアウト情報が示す回答欄の位置を含む所定範囲に対してＯＣＲ処理等の文字認識処理を行うことによって、画像に含まれる文字を取得する。診療支援装置１は、回答欄ごとに取得された一以上の文字を、一群の文字として回答欄に関連付けて管理する。次に診療支援装置１は、文字認識処理によって画像から取得した文字に基づいて回答を特定する（ｃ）。そして診療支援装置１は、特定した回答を記憶部に記憶させ、また表示部に表示させる。 Next, the medical care support device 1 acquires the layout information of the specified questionnaire 31 from the storage unit. The storage unit of the medical care support device 1 stores the layout information of the questionnaire 31 in advance. The layout information indicates the position of each answer column included in the questionnaire 31. Then, the medical care support device 1 acquires characters included in the image by performing character recognition processing such as OCR processing on a predetermined range including the position of the answer column indicated by the layout information on the image. The medical care support device 1 manages one or more characters acquired for each answer column in association with the answer column as a group of characters. Next, the medical care support device 1 identifies the answer based on the characters acquired from the image by the character recognition process (c). Then, the medical care support device 1 stores the specified answer in the storage unit and displays it on the display unit.

図３は、画像から取得した文字に基づいて回答を特定する方法の模式図である。診療支援装置１は、文字認識処理で一群の文字が取得された位置（すなわち一群の文字がいずれの回答欄に記入されていたか）に基づいて、記憶部に記憶された候補単語を絞り込む（ｃ１）。診療支援装置１の記憶部は、問診票３１に記入される可能性のある複数の候補単語を予め記憶している。 FIG. 3 is a schematic diagram of a method of identifying an answer based on characters obtained from an image. The medical care support device 1 narrows down the candidate words stored in the storage unit based on the position where the group of characters was acquired by the character recognition process (that is, which answer column the group of characters was entered in) (c1). ). The storage unit of the medical care support device 1 stores in advance a plurality of candidate words that may be entered in the questionnaire 31.

問診票３１に記載された質問によって、質問に対応する回答欄に記入される可能性のある候補単語が異なる。例えば時間についての質問であれば回答欄には時間を表す単語が記入され、薬剤についての質問であれば回答欄には薬剤名等の単語が記入される。そのため、診療支援装置１は、一群の文字がいずれの回答欄に記入されていたかに基づいて、候補単語を絞り込む。このように診療支援装置１は、一群の文字が取得された位置（回答欄）に記入される可能性のある候補単語を絞り込むため、回答を特定するのに掛かる処理時間を短縮できるとともに、回答を特定する精度を向上できる。このとき、診療支援装置１は、回答欄の位置ごとに予め学習された学習結果に基づいて、候補単語を絞り込む。診療支援装置１は、回答欄ごとに用意された学習データ及び教師データを用いて機械学習を行う。診療支援装置１の記憶部は、機械学習の学習結果を予め記憶している。診療支援装置１は、問診票から読み取られた文字と、文字が取得された位置（回答欄）と、記憶部に記憶されている学習結果とに基づいて、候補単語を絞り込む。このように診療支援装置１は、回答欄ごとに行われた機械学習の学習結果を用いて、画像から取得された一群の文字が示す可能性のある候補単語を絞り込むため、回答を特定するのに掛かる処理時間を短縮できるとともに、回答を特定する精度を向上できる。 Depending on the question written on the questionnaire 31, the candidate words that may be entered in the answer column corresponding to the question differ. For example, in the case of a question about time, a word indicating time is entered in the answer column, and in the case of a question about a drug, a word such as a drug name is entered in the answer column. Therefore, the medical care support device 1 narrows down the candidate words based on which answer column contains the group of characters. In this way, the medical care support device 1 narrows down the candidate words that may be entered in the position (answer column) where the group of characters is acquired, so that the processing time required to identify the answer can be shortened and the answer can be answered. The accuracy of identifying is improved. At this time, the medical care support device 1 narrows down the candidate words based on the learning results learned in advance for each position in the answer column. The medical care support device 1 performs machine learning using the learning data and the teacher data prepared for each answer column. The storage unit of the medical care support device 1 stores the learning result of machine learning in advance. The medical care support device 1 narrows down candidate words based on the characters read from the questionnaire, the position where the characters are acquired (answer column), and the learning result stored in the storage unit. In this way, the medical care support device 1 identifies the answer in order to narrow down the candidate words that the group of characters acquired from the image may indicate by using the learning result of the machine learning performed for each answer column. It is possible to shorten the processing time required for the process and improve the accuracy of identifying the answer.

最後に診療支援装置１は、問診票から読み取られた一群の文字に含まれる各文字と絞り込まれた候補単語それぞれに含まれる各文字との間の文字間距離に基づいて類似度を算出し、算出した類似度を用いて回答を特定する（ｃ２）。文字間距離は、２つの文字の間の外観上の類似性を示す値である。診療支援装置１の記憶部は、文字の各組み合わせの文字間距離を示すマトリクス（行列）を予め記憶している。なお、絞り込みは必須のプロセスではなく、診療支援装置１は、絞り込みの際の評価で、絞り込みの候補の類似度が所定の条件を満たさない場合には絞り込みを行わずに、候補単語をそのまま回答として特定してもよい。 Finally, the medical care support device 1 calculates the similarity based on the inter-character distance between each character included in the group of characters read from the questionnaire and each character contained in each of the narrowed down candidate words. The answer is identified using the calculated similarity (c2). The inter-character distance is a value indicating the appearance similarity between two characters. The storage unit of the medical care support device 1 stores in advance a matrix indicating the distance between characters for each combination of characters. It should be noted that narrowing down is not an indispensable process, and the medical care support device 1 answers the candidate words as they are without narrowing down if the similarity of the narrowing down candidates does not meet the predetermined conditions in the evaluation at the time of narrowing down. May be specified as.

診療支援装置１は、問診票から読み取られた一群の文字に含まれる各文字と絞り込まれた候補単語それぞれに含まれる各文字との間の文字間距離に基づいて類似度を算出し、算出した類似度が所定の条件を満たす候補単語を回答として特定する。このように診療支援装置１は、文字同士の類似性を考慮して候補単語の中から画像から取得された一群の文字に対応する回答を特定するため、問診票に記入された回答を高精度に特定できる。 The medical care support device 1 calculated and calculated the similarity based on the inter-character distance between each character included in a group of characters read from the questionnaire and each character included in each of the narrowed down candidate words. A candidate word whose similarity satisfies a predetermined condition is specified as an answer. In this way, the medical care support device 1 identifies the answer corresponding to the group of characters acquired from the image from the candidate words in consideration of the similarity between the characters, so that the answer entered in the questionnaire is highly accurate. Can be specified.

以上のように、本実施形態に係る診療支援システムＳは、問診票の画像に対する文字認識処理の誤認識による誤りがある場合や、患者が回答を記入する際に誤りをした場合であっても、問診票に記入された回答を高精度にかつ短い処理時間で取得できる。 As described above, the medical care support system S according to the present embodiment has an error due to an erroneous recognition of the character recognition process for the image of the questionnaire, or even if the patient makes an error when entering the answer. , The answers entered in the questionnaire can be obtained with high accuracy and in a short processing time.

［診療支援システムＳの構成］
図４は、本実施形態に係る診療支援システムＳのブロック図である。図４において、矢印は主なデータの流れを示しており、図４に示したもの以外のデータの流れがあってよい。図４において、各ブロックはハードウェア（装置）単位の構成ではなく、機能単位の構成を示している。そのため、図４に示すブロックは単一の装置内に実装されてよく、あるいは複数の装置内に分かれて実装されてよい。ブロック間のデータの授受は、データバス、ネットワーク、可搬記憶媒体等、任意の手段を介して行われてよい。 [Configuration of medical care support system S]
FIG. 4 is a block diagram of the medical care support system S according to the present embodiment. In FIG. 4, the arrows indicate the main data flows, and there may be data flows other than those shown in FIG. In FIG. 4, each block shows a functional unit configuration, not a hardware (device) unit configuration. Therefore, the block shown in FIG. 4 may be mounted in a single device, or may be mounted separately in a plurality of devices. Data transfer between blocks may be performed via an arbitrary means such as a data bus, a network, or a portable storage medium.

診療支援装置１（情報処理装置）は、制御部１１と、記憶部１２とを有する。制御部１１は、記入内容取得部１１１と、絞込部１１２と、回答特定部１１３と、出力部１１４とを有する。記憶部１２は、レイアウト記憶部１２１と、候補単語記憶部１２２と、類似情報記憶部１２３と、回答記憶部１２４とを有する。 The medical care support device 1 (information processing device) has a control unit 11 and a storage unit 12. The control unit 11 includes an entry content acquisition unit 111, a narrowing unit 112, a response identification unit 113, and an output unit 114. The storage unit 12 includes a layout storage unit 121, a candidate word storage unit 122, a similar information storage unit 123, and an answer storage unit 124.

記憶部１２は、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）、ハードディスクドライブ等を含む記憶媒体である。記憶部１２は、制御部１１が実行するプログラムを予め記憶する。レイアウト記憶部１２１、候補単語記憶部１２２、類似情報記憶部１２３及び回答記憶部１２４は、それぞれ記憶部１２上の記憶領域であってもよく、あるいは記憶部１２上で構成されたデータベースであってもよい。 The storage unit 12 is a storage medium including a ROM (Read Only Memory), a RAM (Random Access Memory), a hard disk drive, and the like. The storage unit 12 stores in advance the program executed by the control unit 11. The layout storage unit 121, the candidate word storage unit 122, the similar information storage unit 123, and the answer storage unit 124 may each be a storage area on the storage unit 12, or are a database configured on the storage unit 12. May be good.

レイアウト記憶部１２１は、問診票３１に含まれる各回答欄の位置を示すレイアウト情報を、問診票３１の識別情報に関連付けて予め記憶している。候補単語記憶部１２２は、問診票３１に記入される可能性のある複数の候補単語を予め記憶している。類似情報記憶部１２３は、文字の各組み合わせの文字間距離を示すマトリクスを類似情報として予め記憶している。回答記憶部１２４は、診療支援装置１が特定した患者の回答を、患者の識別情報に関連付けて記憶する。 The layout storage unit 121 stores in advance layout information indicating the position of each answer column included in the questionnaire 31 in association with the identification information of the questionnaire 31. The candidate word storage unit 122 stores in advance a plurality of candidate words that may be entered in the questionnaire 31. The similar information storage unit 123 stores in advance a matrix indicating the inter-character distance of each combination of characters as similar information. The answer storage unit 124 stores the patient's answer identified by the medical care support device 1 in association with the patient's identification information.

制御部１１は、例えばＣＰＵ（Central Processing Unit）等のプロセッサであり、記憶部１２に記憶されたプログラムを実行することにより、記入内容取得部１１１、絞込部１１２、回答特定部１１３及び出力部１１４として機能する。制御部１１の機能の少なくとも一部は、電気回路によって実行されてもよい。また、制御部１１の機能の少なくとも一部は、ネットワーク経由で実行されるプログラムによって実行されてもよい。 The control unit 11 is, for example, a processor such as a CPU (Central Processing Unit), and by executing a program stored in the storage unit 12, the entry content acquisition unit 111, the narrowing unit 112, the answer identification unit 113, and the output unit Functions as 114. At least some of the functions of the control unit 11 may be performed by an electric circuit. Further, at least a part of the functions of the control unit 11 may be executed by a program executed via the network.

本実施形態に係る診療支援システムＳは、図４に示す具体的な構成に限定されない。例えば診療支援装置１及び診療用端末２は、一体化されて１つの装置として構成されてもよい。 The medical care support system S according to the present embodiment is not limited to the specific configuration shown in FIG. For example, the medical care support device 1 and the medical care terminal 2 may be integrated into one device.

［問診票３１から文字を取得する処理の説明］
診療支援システムＳが実行する診療支援方法（情報処理方法）のうち、問診票３１から文字を取得する処理を以下に説明する。図５は、例示的な問診票３１の正面図である。問診票３１は、回答領域３１１を表している。問診票３１は、それぞれ異なる回答領域３１１を表している複数の紙を含んでおり、複数の紙はステープラ等の固定具Ｆによって束ねられている。問診票３１が一枚の紙である場合には、固定具Ｆは省略されてもよい。 [Explanation of the process of acquiring characters from the questionnaire 31]
Among the medical treatment support methods (information processing methods) executed by the medical treatment support system S, the process of acquiring characters from the questionnaire 31 will be described below. FIG. 5 is a front view of an exemplary questionnaire 31. Questionnaire 31 represents the answer area 311. The questionnaire 31 includes a plurality of papers each representing a different answer area 311. The plurality of papers are bundled by a fixture F such as a stapler. If the questionnaire 31 is a sheet of paper, the fixture F may be omitted.

回答領域３１１は、１つ以上の質問３１２と、患者が質問３１２に対する回答を記入するための１つ以上の回答欄３１３と、質問３１２及び回答欄３１３に関連付けられた医学用語の見出し３１４とを含む。回答領域３１１は、１つの質問３１２に対して１つの回答欄３１３を含んでもよく、複数の回答欄３１３を含んでもよい。回答領域３１１は、患者に限らず、医療従事者が情報を記入するための欄を含んでもよい。 The answer area 311 contains one or more questions 312, one or more answer columns 313 for the patient to fill in the answer to the question 312, and the medical term headings 314 associated with the questions 312 and answer columns 313. Including. The answer area 311 may include one answer column 313 for one question 312, or may include a plurality of answer columns 313. The response area 311 is not limited to the patient, and may include a column for the medical staff to enter information.

質問３１２は、患者に対する質問を表す文字列である。回答欄３１３は、患者が質問３１２に対する回答を記入するための欄である。回答欄３１３は、患者が回答内容を選択するチェックボックス等の選択欄であってもよく、あるいは患者が回答内容を表す文字を記入する自由記入欄であってもよい。 Question 312 is a character string representing a question to the patient. The answer column 313 is a column for the patient to enter an answer to the question 312. The answer column 313 may be a selection column such as a check box in which the patient selects the answer content, or may be a free entry field in which the patient enters characters representing the answer content.

見出し３１４は、質問３１２及び回答欄３１３の内容を表す、医師が解釈する医学用語である。見出し３１４は、質問３１２及び回答欄３１３の近傍に配置されている。患者が理解できる言葉と医学用語との間には乖離があることが多く、質問３１２に医学用語を記載すると患者が理解できない可能性がある。そのため、質問３１２は平易な表現で記載されることが多い。これにより、医療従事者は、質問３１２及び回答欄３１３の内容を医学的に理解する際に、平易な表現から医学用語へ変換する必要があるため時間が掛かる。 Heading 314 is a physician-interpreted medical term that describes the content of question 312 and answer column 313. Heading 314 is located near question 312 and answer column 313. There is often a discrepancy between the words that the patient can understand and the medical terms, and if the medical terms are listed in Question 312, the patient may not understand them. Therefore, question 312 is often written in plain language. As a result, it takes time for the medical staff to convert the simple expressions into medical terms when medically understanding the contents of the question 312 and the answer column 313.

そこで問診票３１が質問３１２及び回答欄３１３に関連付けられた医学用語の見出し３１４を表すことによって、医療従事者は、平易な表現から医学用語へ変換する必要がなくなり、質問３１２及び回答欄３１３の内容を医学的に理解する時間を短縮できる。例えば「既往歴」という医学用語に対応する質問３１２は「今までご病気をされたことがありますか？」という表記となるが、医療従事者にとっては「既往歴」という文字列の方が短く一瞬で内容が理解されるメリットがある。また、回答欄３１３を医学用語に関連付けることにより、医療従事者は回答欄３１３の記入が正しく行われているのかのチェックがすばやくできる。 Therefore, by displaying the medical term heading 314 associated with the question 312 and the answer column 313 on the questionnaire 31, the medical staff does not need to convert the plain expression into the medical term, and the question 312 and the answer column 313 do not need to be converted. You can shorten the time to understand the contents medically. For example, Question 312, which corresponds to the medical term "history", is written as "Have you ever been ill?", But for healthcare professionals, the string "history" is shorter. There is a merit that the contents can be understood in an instant. Further, by associating the answer column 313 with the medical terminology, the medical staff can quickly check whether the answer column 313 is entered correctly.

また、「薬剤歴」という医学用語に対応する質問３１２は「現在飲まれている薬剤はなんですか？」という表記となるが、回答欄３１３には薬剤名が記載されることになる。日本で発売されている薬剤は約２０万種類程度に限られている。そのため、後述のように回答欄３１３に関連付けられた「薬剤歴」の回答欄３１３に関連付けて、薬剤名の候補単語を登録しておき、また頻度情報を学習に入れておくことにより、解を絞り込むことが可能になり、回答を特定するための精度を上げることができ、回答を特定するのに掛かる処理時間も短縮できる。同時に、問診票のスキャン結果や問診票を見るときに「薬剤歴」という短い医学用語が記載されていることで、医療従事者は一瞬でどこにどのような情報があるのかが理解できる。特に薬剤名にはカタカナが多いため、文字認識処理において、ソ（そ）とン（ん）、シ（し）とツ（つ）、ツ（大文字）とッ（小文字）、パ（ぱ）とバ（ば）等、誤認識が発生しやすい。従来これらの手書き入力の精度を向上させることは難しく、薬剤名を間違えることは即患者さんの生死に関わることになるため、文字認識処理の精度を十分に上げ、かつ医療従事者が直感的に理解できるようになることは大事なことであった。本実施形態に係る診療支援システムＳは、医学単語の知覚の記載であるため医療従事者が直感的にチェックを行うことを可能にすると同時に、回答を特定する精度を向上させることができる。 In addition, the question 312 corresponding to the medical term "drug history" is written as "what drug is currently being taken?", But the drug name is described in the answer column 313. The number of drugs on the market in Japan is limited to about 200,000. Therefore, as described later, the solution is solved by registering the candidate word of the drug name in association with the answer column 313 of the "drug history" associated with the answer column 313 and incorporating the frequency information into the learning. It becomes possible to narrow down, improve the accuracy for identifying the answer, and shorten the processing time required to identify the answer. At the same time, the short medical term "drug history" is included when looking at the scan results of the questionnaire and the questionnaire, so that the medical staff can instantly understand where and what kind of information is available. In particular, there are many katakana in drug names, so in character recognition processing, so (so) and n (n), shi (shi) and tsu (tsu), tsu (uppercase) and tsu (lowercase), pa (pa) Misrecognition such as ba is likely to occur. Conventionally, it is difficult to improve the accuracy of these handwritten inputs, and mistaken drug names can immediately affect the life and death of patients. Therefore, the accuracy of character recognition processing can be sufficiently improved, and medical staff can intuitively improve it. It was important to be able to understand. Since the medical care support system S according to the present embodiment describes the perception of medical words, it is possible for the medical staff to intuitively perform the check, and at the same time, it is possible to improve the accuracy of identifying the answer.

さらに問診票３１は、回答領域３１１の位置決めに用いられる複数の記号３１５と、問診票３１を特定するためのコード３１６とを含む。複数の記号３１５は、回答領域３１１の外側の所定位置に配置されている。複数の記号３１５は、円形、矩形、多角形等の所定形状の記号であり、同一の形状であってもよく、異なる形状であってもよい。また、複数の記号３１５は、線で囲まれた領域などの４隅またはその中間地点であってもよい。コード３１６は、例えば問診票３１の識別情報（問診票３１のＩＤ等）を符号化したバーコード又は二次元コードである。診療支援装置１は、後述のように問診票３１の画像に含まれる記号３１５及びコード３１６に基づいて、問診票３１に含まれる回答領域３１１及び回答欄３１３の位置を特定することができる。 Further, the questionnaire 31 includes a plurality of symbols 315 used for positioning the answer area 311 and a code 316 for identifying the questionnaire 31. The plurality of symbols 315 are arranged at predetermined positions outside the response area 311. The plurality of symbols 315 are symbols having a predetermined shape such as a circle, a rectangle, and a polygon, and may have the same shape or different shapes. Further, the plurality of symbols 315 may be four corners such as an area surrounded by a line or intermediate points thereof. The code 316 is, for example, a barcode or a two-dimensional code in which the identification information of the questionnaire 31 (ID of the questionnaire 31 or the like) is encoded. The medical care support device 1 can specify the positions of the answer area 311 and the answer column 313 included in the questionnaire 31 based on the symbol 315 and the code 316 included in the image of the questionnaire 31 as described later.

図５に示した問診票３１は一例であり、その他のレイアウトであってもよく、その他の情報を含んでもよい。 The questionnaire 31 shown in FIG. 5 is an example, and may have other layouts or may include other information.

患者は、問診票３１において、質問３１２に対する回答を回答欄３１３へ記入する。次に医療従事者又は患者は、問診票３１のうち固定具Ｆを含む一部を切断し、問診票３１を１枚ずつの紙に分ける。そして医療従事者又は患者は、読取装置３に問診票３１を読み取らせる。読取装置３は、患者が回答した問診票３１を読み取り、問診票３１の画像を診療支援装置１へ送信する。 The patient fills in the answer column 313 with the answer to the question 312 on the questionnaire 31. Next, the medical staff or the patient cuts a part of the questionnaire 31 including the fixture F, and divides the questionnaire 31 into sheets of paper. Then, the medical staff or the patient has the reading device 3 read the questionnaire 31. The reading device 3 reads the questionnaire 31 answered by the patient and transmits the image of the questionnaire 31 to the medical care support device 1.

患者用端末を用いて患者の回答を受け付ける場合には、患者用端末は、表示部上に問診票３１の内容を表示する。患者は、電子的な問診票を表示している患者用端末に対して指又はタッチペンを用いて手書き入力をすることによって回答を記入する。患者用端末は、患者が電子的な問診票３１上で行った手書き入力の情報（例えば手書き入力の軌跡）を、問診票３１の画像として診療支援装置１へ送信する。問診票３１の画像として、患者が行った入力を示すベクトルデータを用いてもよい。 When the patient's answer is accepted using the patient's terminal, the patient's terminal displays the contents of the questionnaire 31 on the display unit. The patient fills in the answer by handwriting with a finger or a stylus on the patient terminal displaying the electronic questionnaire. The patient terminal transmits the handwritten input information (for example, the trajectory of the handwritten input) performed by the patient on the electronic questionnaire 31 to the medical care support device 1 as an image of the questionnaire 31. As the image of the questionnaire 31, vector data indicating the input made by the patient may be used.

診療支援装置１において、まず記入内容取得部１１１は、受信した問診票３１の画像の中で、記号３１５及びコード３１６を検出する。ここで、コード３１６に関連付けられた情報にどちらのページが先に読み取られると医師が読みやすいかの順序情報を予め関連付けておき、記入内容取得部１１１は、該順序情報に基づいて問診票３１が含む複数のページを並び替えてもよい。これにより、問診票３１が含む複数のページが誤った順番でスキャンされた場合でも、該複数のページが正しい順番に沿った結果に並び替えることが可能である。そして記入内容取得部１１１は、検出したコード３１６が示す問診票３１の識別情報を取得するとともに、検出した記号３１５に基づいて回答領域３１１の位置を特定する。 In the medical care support device 1, first, the entry content acquisition unit 111 detects the symbol 315 and the code 316 in the image of the received questionnaire 31. Here, the information associated with the code 316 is associated with the order information of which page is read first, which is easier for the doctor to read, and the entry content acquisition unit 111 uses the questionnaire 31 based on the order information. You may rearrange a plurality of pages included in. As a result, even if a plurality of pages included in the questionnaire 31 are scanned in an erroneous order, the plurality of pages can be rearranged in the correct order. Then, the entry content acquisition unit 111 acquires the identification information of the questionnaire 31 indicated by the detected code 316, and identifies the position of the answer area 311 based on the detected symbol 315.

図６（ａ）、図６（ｂ）は、記入内容取得部１１１が回答領域３１１の位置を特定する方法の模式図である。図６（ａ）は一部が切り取られた問診票３１を表しており、図６（ｂ）は切り取られていない問診票３１を表している。図６（ａ）の状態では、記入内容取得部１１１は、問診票３１において、左上の記号３１５を検出せず、中央上の記号３１５及び右上の記号３１５を検出する。 6 (a) and 6 (b) are schematic views of a method in which the entry content acquisition unit 111 specifies the position of the response area 311. FIG. 6A shows a partially cut-out questionnaire 31, and FIG. 6B shows an uncut questionnaire 31. In the state of FIG. 6A, the entry content acquisition unit 111 does not detect the symbol 315 on the upper left, but detects the symbol 315 on the center and the symbol 315 on the upper right on the questionnaire 31.

記入内容取得部１１１は、右上の記号３１５から中央上の記号３１５へのベクトルＶ１（線分）を特定する。そして記入内容取得部１１１は、ベクトルＶ１の長さに、所定の係数を乗算することによって、回答領域３１１を区画する上辺Ｅの長さを算出することにより、上辺Ｅの位置を特定する。具体的には、記入内容取得部１１１は、問診票３１の水平方向における記号３１５と同じ位置を起点として、ベクトルＶ１の長さに係数を乗算した値を幅とする領域を、回答領域３１１の水平方向の位置であると特定する。係数は、診療支援装置１のレイアウト記憶部１２１に予め設定される。係数は、問診票３１の識別情報ごとに設定されてもよく、全ての問診票３１で共通して設定されてもよい。 The entry content acquisition unit 111 identifies the vector V1 (line segment) from the symbol 315 on the upper right to the symbol 315 on the center. Then, the entry content acquisition unit 111 specifies the position of the upper side E by calculating the length of the upper side E that divides the response area 311 by multiplying the length of the vector V1 by a predetermined coefficient. Specifically, the entry content acquisition unit 111 sets an area having the width obtained by multiplying the length of the vector V1 by a coefficient from the same position as the symbol 315 in the horizontal direction of the questionnaire 31 as the width of the answer area 311. Identify the position in the horizontal direction. The coefficient is preset in the layout storage unit 121 of the medical care support device 1. The coefficient may be set for each identification information of the questionnaire 31, or may be set in common for all the questionnaires 31.

例えば中央上の記号３１５が回答領域３１１の上辺Ｅの中点の上側に配置されており、右上の記号３１５が回答領域３１１の上辺Ｅの右端の上側に配置されている場合には、係数は２である。中央上の記号３１５が回答領域３１１の上辺Ｅを３等分する２点のうち右の点の上側に配置されており、右上の記号３１５が回答領域３１１の上辺Ｅの右端の上側に配置されている場合には、係数は３である。 For example, if the symbol 315 on the center is located above the midpoint of the upper side E of the answer area 311 and the symbol 315 on the upper right is located above the right edge of the upper side E of the answer area 311, the coefficient is It is 2. The symbol 315 on the center is placed above the right point of the two points that divide the upper side E of the answer area 311 into three equal parts, and the symbol 315 on the upper right is placed above the right end of the upper side E of the answer area 311. If so, the coefficient is 3.

記入内容取得部１１１は、右上の記号３１５から中央上の記号３１５へのベクトルＶ１の長さに、問診票３１に対して設定されたこれらの係数を乗算した後に、回答領域３１１と記号３１５との間の距離に対応する所定値移動させることによって、回答領域３１１の上辺Ｅの位置を特定できる。 The entry content acquisition unit 111 multiplies the length of the vector V1 from the symbol 315 on the upper right to the symbol 315 on the center by these coefficients set for the questionnaire 31, and then adds the answer area 311 and the symbol 315. The position of the upper side E of the answer area 311 can be specified by moving a predetermined value corresponding to the distance between the two.

図６（ｂ）の状態では、記入内容取得部１１１は、問診票３１において、左上の記号３１５、中央上の記号３１５及び右上の記号３１５を検出する。記入内容取得部１１１は、右上の記号３１５から左上の記号３１５へのベクトルＶ２（線分）を特定する。そして記入内容取得部１１１は、ベクトルＶ２を、回答領域３１１と記号３１５との間の距離に対応する所定値移動させることによって、回答領域３１１の上辺Ｅの位置を特定できる。 In the state of FIG. 6B, the entry content acquisition unit 111 detects the symbol 315 on the upper left, the symbol 315 on the center, and the symbol 315 on the upper right on the questionnaire 31. The entry content acquisition unit 111 specifies the vector V2 (line segment) from the symbol 315 on the upper right to the symbol 315 on the upper left. Then, the entry content acquisition unit 111 can specify the position of the upper side E of the answer area 311 by moving the vector V2 by a predetermined value corresponding to the distance between the answer area 311 and the symbol 315.

問診票３１において、左上の記号３１５（すなわち切り取られる可能性のある記号３１５）は、最初から省略されてもよい。この場合には、記入内容取得部１１１は、問診票３１の一部が切り取られているか否かに関わらず、中央上の記号３１５及び右上の記号３１５の間のベクトルＶ１の長さに所定の係数を乗算することによって回答領域３１１の上辺Ｅの位置を特定する。 In the questionnaire 31, the symbol 315 on the upper left (that is, the symbol 315 that may be cut off) may be omitted from the beginning. In this case, the entry content acquisition unit 111 determines the length of the vector V1 between the symbol 315 on the center and the symbol 315 on the upper right regardless of whether or not a part of the questionnaire 31 is cut off. The position of the upper side E of the answer area 311 is specified by multiplying the coefficients.

ここでは記入内容取得部１１１が回答領域３１１を区画する上辺の位置を特定する方法を説明したが、回答領域３１１を区画する左辺、右辺及び下辺の位置についても同様の方法で特定できる。 Here, the method of specifying the position of the upper side that divides the answer area 311 by the entry content acquisition unit 111 has been described, but the positions of the left side, the right side, and the lower side that divide the answer area 311 can also be specified by the same method.

記入内容取得部１１１は、特定した回答領域３１１の位置に基づいて、受信した問診票３１の画像を補正する。これにより、記入内容取得部１１１は、読取装置３に問診票３１を読み取らせる際に発生した画像の歪みや傾きを補正することができ、回答欄３１３に記入された文字を高精度に取得できる。 The entry content acquisition unit 111 corrects the received image of the questionnaire 31 based on the position of the specified answer area 311. As a result, the entry content acquisition unit 111 can correct the distortion and inclination of the image generated when the reading device 3 reads the questionnaire 31, and can acquire the characters entered in the answer column 313 with high accuracy. ..

このように、記入内容取得部１１１は、固定具Ｆを切り取ったこと等により複数の記号３１５のうち一部が欠損した場合又は切り取られる可能性のある位置（例えば問診票３１の左上）に記号３１５が配置されていない場合であっても、回答領域３１１の位置を特定し、画像を補正できる。記入内容取得部１１１は、ここに示した具体的な方法に限定されず、その他の方法によって記号３１５の位置に基づいて回答領域３１１の位置を特定してもよい。 In this way, the entry content acquisition unit 111 has a symbol at a position (for example, the upper left of the questionnaire 31) where a part of the plurality of symbols 315 is missing or may be cut due to cutting the fixture F or the like. Even when the 315 is not arranged, the position of the response area 311 can be specified and the image can be corrected. The entry content acquisition unit 111 is not limited to the specific method shown here, and may specify the position of the response area 311 based on the position of the symbol 315 by another method.

次に記入内容取得部１１１は、問診票３１の画像に基づいて、回答欄３１３に記入された内容を取得する。記入内容取得部１１１は、チェックボックス等の選択欄である回答欄３１３で選択された項目を記入内容として取得するとともに、自由記入欄である回答欄３１３に記入された一群の文字を記入内容として取得する。自由記入欄の読み取り内容には誤りが含まれやすいため、以下では自由記入欄である回答欄３１３に記入された一群の文字に基づいて患者の回答を特定する方法を説明する。 Next, the entry content acquisition unit 111 acquires the content entered in the answer column 313 based on the image of the questionnaire 31. The entry content acquisition unit 111 acquires the items selected in the answer column 313, which is a selection column such as a check box, as the entry content, and also uses a group of characters entered in the answer column 313, which is a free entry column, as the entry content. get. Since the reading content of the free entry field is likely to contain errors, the method of identifying the patient's answer based on the group of characters entered in the answer field 313, which is the free entry field, will be described below.

記入内容取得部１１１は、画像から検出したコード３１６が示す問診票３１の識別情報に関連付けられたレイアウト情報を、レイアウト記憶部１２１から取得する。レイアウト情報は、問診票３１上の所定位置を基準とした、問診票３１に含まれる各回答欄３１３の相対的な位置を示す。基準とする所定位置は、例えば右上の記号３１５の位置であってもよく、コード３１６の位置であってもよく、回答領域３１１のいずれかの頂点の位置であってもよい。回答欄３１３の位置は、例えば問診票３１の画像中の回答欄３１３の座標範囲によって表される。また、レイアウト情報は、問診票３１上の所定位置（例えば左端や上端）を基準とした複数の回答欄３１３の順番によって、各回答欄３１３の相対的な位置を示してもよい。 The entry content acquisition unit 111 acquires the layout information associated with the identification information of the questionnaire 31 indicated by the code 316 detected from the image from the layout storage unit 121. The layout information indicates the relative position of each answer column 313 included in the questionnaire 31 with reference to a predetermined position on the questionnaire 31. The predetermined position as a reference may be, for example, the position of the symbol 315 on the upper right, the position of the code 316, or the position of any apex of the answer area 311. The position of the answer column 313 is represented by, for example, the coordinate range of the answer column 313 in the image of the questionnaire 31. Further, the layout information may indicate the relative position of each answer column 313 in the order of the plurality of answer columns 313 with reference to a predetermined position (for example, the left end or the upper end) on the questionnaire 31.

記入内容取得部１１１は、問診票３１の画像の中でレイアウト情報が示す各回答欄３１３の位置を含む画像を取り出す。記入内容取得部１１１は、各回答欄３１３について取り出した画像に対してＯＣＲ等の既知の文字認識処理を適用することによって、回答欄３１３に記入された一群の文字（一以上の文字）を取得する。記入内容取得部１１１は、取得した一群の文字を、取得元の回答欄３１３に関連付けておく。 The entry content acquisition unit 111 takes out an image including the position of each answer column 313 indicated by the layout information in the image of the questionnaire 31. The entry content acquisition unit 111 acquires a group of characters (one or more characters) entered in the answer column 313 by applying a known character recognition process such as OCR to the image extracted for each answer column 313. To do. The entry content acquisition unit 111 associates the acquired group of characters with the answer column 313 of the acquisition source.

［候補単語を絞り込む処理の説明］
診療支援システムＳが実行する診療支援方法のうち、候補単語を絞り込む処理を以下に説明する。候補単語記憶部１２２は、問診票３１に記入される可能性のある複数の候補単語を予め記憶している。候補単語記憶部１２２は、回答欄３１３に記載されることが想定される候補単語だけでなく、過去に回答欄３１３に記載された単語を学習することによって取得された候補単語を記憶してもよい。学習は、候補単語が出現するか否かだけでなく、候補単語の出現頻度等について行われる。 [Explanation of processing to narrow down candidate words]
Among the medical care support methods executed by the medical care support system S, the process of narrowing down the candidate words will be described below. The candidate word storage unit 122 stores in advance a plurality of candidate words that may be entered in the questionnaire 31. The candidate word storage unit 122 may memorize not only the candidate words that are expected to be described in the answer column 313 but also the candidate words acquired by learning the words that are described in the answer column 313 in the past. Good. Learning is performed not only on whether or not a candidate word appears, but also on the frequency of appearance of the candidate word and the like.

また、候補単語記憶部１２２は、複数の候補単語それぞれを、１つの単語そのものの状態と、１つの単語を複数の部分に分割した状態と、複数の単語を結合した状態とのうち少なくとも１つの状態で記憶してもよい。具体的には、候補単語記憶部１２２は、候補単語をｍｏｎｏｇｒａｍ（１文字）、ｂｉｇｒａｍ（２文字）、ｔｒｉｇｒａｍ（３文字）等の連続した１つ以上の文字ごとの部分に分割して記憶してもよい。この場合には、候補単語記憶部１２２は、候補単語を分割した複数の部分を空白文字やカンマ（読点）によって結合して記憶する。また、候補単語記憶部１２２は、異なる複数の単語を、１つの単語として結合して記憶してもよい。この場合には、候補単語記憶部１２２は、複数の単語を空白文字やカンマ（読点）によって結合して記憶する。 Further, the candidate word storage unit 122 has at least one of a state of one word itself, a state of dividing one word into a plurality of parts, and a state of combining a plurality of words for each of the plurality of candidate words. It may be memorized in the state. Specifically, the candidate word storage unit 122 divides and stores the candidate word into one or more consecutive character-by-character parts such as monogram (1 character), bigram (2 characters), and trigram (3 characters). You may. In this case, the candidate word storage unit 122 stores a plurality of divided parts of the candidate word by combining them with a blank character or a comma (a reading point). Further, the candidate word storage unit 122 may combine and memorize a plurality of different words as one word. In this case, the candidate word storage unit 122 stores a plurality of words by combining them with a blank character or a comma (a reading point).

絞込部１１２は、候補単語記憶部１２２に記憶された複数の候補単語に対して、絞込処理を行う。図７は、絞込部１１２による絞込処理の模式図である。まず絞込部１１２は、候補単語記憶部１２２から候補単語を取得する。また、絞込部１１２は、記入内容取得部１１１が文字認識処理によって取得した一群の文字それぞれについて、該一群の文字が取得された位置（すなわち該一群の文字がいずれの回答欄３１３に記入されていたか）を取得する。 The narrowing-down unit 112 performs a narrowing-down process on a plurality of candidate words stored in the candidate word storage unit 122. FIG. 7 is a schematic view of the narrowing process by the narrowing section 112. First, the narrowing unit 112 acquires a candidate word from the candidate word storage unit 122. Further, in the narrowing-down unit 112, for each of the group of characters acquired by the entry content acquisition unit 111 by the character recognition process, the position where the group of characters is acquired (that is, the group of characters is entered in any of the answer columns 313). Was it) to get.

第１の方法として、絞込部１１２は、一群の文字が記入された回答欄３１３に関連付けられた医学用語の見出し３１４（すなわち、回答欄３１３の中又は近傍に記載された医学用語）に基づいて、候補単語を絞り込む。この場合に、候補単語記憶部１２２に記憶された複数の候補単語は、それぞれ見出し３１４として利用され得る一以上の医学用語に予め関連付けられている。 As a first method, the narrowing section 112 is based on the medical term heading 314 (ie, the medical term described in or near the answer column 313) associated with the answer column 313 in which a group of letters is entered. To narrow down the candidate words. In this case, the plurality of candidate words stored in the candidate word storage unit 122 are previously associated with one or more medical terms that can be used as headings 314, respectively.

記入内容取得部１１１は、問診票３１の画像に基づいて、回答欄３１３の一群の文字に加えて、回答欄３１３に関連付けられた見出し３１４の内容を取得する。図７の例では、「ロキソプロフェン」と記入されている回答欄３１３に、「アレルギー」の見出し３１４が関連付けられている。ここでは、１つの枠線の中に回答欄３１３と見出し３１４とが配置されていることによって、それらが関連付けられている。１つの見出し３１４に複数の回答欄３１３が関連付けられてもよい。ここに示した方法に限られず、データベースによる関連付け等、その他の方法によって回答欄３１３と見出し３１４とが関連付けられてもよい。 Based on the image of the questionnaire 31, the entry content acquisition unit 111 acquires the content of the heading 314 associated with the answer column 313 in addition to the group of characters of the answer column 313. In the example of FIG. 7, the heading 314 of "allergy" is associated with the answer column 313 in which "loxoprofen" is written. Here, the answer column 313 and the heading 314 are arranged in one frame, so that they are related to each other. A plurality of answer columns 313 may be associated with one heading 314. The method is not limited to the method shown here, and the answer column 313 and the heading 314 may be associated with each other by other methods such as association by a database.

「アレルギー」の見出し３１４に関連付けられた自由記入欄である回答欄３１３には、薬剤名が記入される可能性がある。そのため、絞込部１１２は、候補単語記憶部１２２に記憶された候補単語を、「アレルギー」の見出し３１４に関連付けられた候補単語（具体的には薬剤名に関する候補単語）に絞り込む。 The drug name may be entered in the answer column 313, which is a free entry column associated with the heading 314 of "allergy". Therefore, the narrowing-down unit 112 narrows down the candidate words stored in the candidate word storage unit 122 to the candidate words associated with the heading 314 of "allergy" (specifically, the candidate words related to the drug name).

また、図５の例では、「発熱」の見出し３１４に関連付けられた自由記入欄である回答欄３１３には、時間が記入される可能性がある。そのため、絞込部１１２は、候補単語記憶部１２２に記憶された候補単語を、「発熱」の見出し３１４に関連付けられた候補単語（具体的には時間に関する候補単語）に絞り込む。 Further, in the example of FIG. 5, the time may be entered in the answer column 313, which is a free entry column associated with the heading 314 of "fever". Therefore, the narrowing unit 112 narrows down the candidate words stored in the candidate word storage unit 122 to the candidate words associated with the heading 314 of "fever" (specifically, the candidate words related to time).

診療支援装置１は、予め機械学習を行うことによって、医学用語と候補単語とを関連付けて候補単語記憶部１２２に記憶している。具体的には、診療支援装置１は、例示的な問診票３１の回答欄３１３から読み取られた一群の文字（すなわち学習データ）と、回答欄３１３に関連付けられた見出し３１４の医学用語ごとに設定された一群の文字が示す単語（すなわち教師データ）とを用いて予め機械学習を行い、回答欄３１３に関連付けられた見出し３１４の医学用語ごとに一群の文字と単語との関係を示すモデルを生成する。そして診療支援装置１は、生成したモデルを候補単語記憶部１２２に記憶させる。例えば機械学習は、Convolutional Neural Network、Bidirectional Long Short-term Neural Network、及びConnectionist Temporal Classificationを連続に組み合わせることによって行われる。Convolutional Neural Networkは文字の１文字１文字を学習し、Bidirectional Long Short-term Neural Networkは文字候補とそれぞれの文字との確率の組の列に変換し、Connectionist Temporal Classificationは文字列に変換する。Connectionist Temporal Classificationの探索アルゴリズムには、出てくるアウトプットが、候補単語の中に絞られるように制限が加えられている。診療支援装置１は、その他の既知の機械学習方法を用いてもよい。例えば、診療支援装置１は、機械学習で学習された分類器を用いて、記入された文字列から１つ又は複数の文字への分割を行い、その後、過剰に分割された１つ又は複数の文字を結合する。次に、診療支援装置１は、機械学習の結果学習された分類器を用いて、分割又は結合された１つ又は複数の文字それぞれに対して認識した結果として文字候補とその確率を取得し、それらを結合することで医学単語の候補群とする。そして診療支援装置１は、それらの候補群の文字列を医学用語のスペルチェック等のアルゴリズムにて変換することによって、候補単語を取得してもよい。 The medical care support device 1 stores medical terms and candidate words in association with each other in the candidate word storage unit 122 by performing machine learning in advance. Specifically, the medical care support device 1 is set for each group of characters (that is, learning data) read from the answer column 313 of the exemplary questionnaire 31 and the medical term of the heading 314 associated with the answer column 313. Machine learning is performed in advance using the words (that is, teacher data) indicated by the group of letters, and a model showing the relationship between the group of letters and words is generated for each medical term of the heading 314 associated with the answer column 313. To do. Then, the medical care support device 1 stores the generated model in the candidate word storage unit 122. For example, machine learning is performed by continuously combining Convolutional Neural Network, Bidirectional Long Short-term Neural Network, and Connectionist Temporal Classification. Convolutional Neural Network learns each character character by character, Bidirectional Long Short-term Neural Network converts it into a string of probability pairs of character candidates and each character, and Connectionist Temporal Classification converts it into a character string. The search algorithm of Connectionist Temporal Classification is restricted so that the output that comes out is narrowed down to the candidate words. The medical care support device 1 may use other known machine learning methods. For example, the medical care support device 1 divides the entered character string into one or more characters by using a classifier learned by machine learning, and then divides one or more characters into excessively divided characters. Combine characters. Next, the medical care support device 1 acquires a character candidate and its probability as a result of recognizing each of one or a plurality of divided or combined characters by using a classifier learned as a result of machine learning. By combining them, it becomes a candidate group of medical words. Then, the medical care support device 1 may acquire candidate words by converting the character strings of those candidate groups by an algorithm such as spell check of medical terms.

絞込部１１２は、候補単語記憶部１２２から、学習結果であるモデルを取得する。次に絞込部１１２は、記入内容取得部１１１が取得した一群の文字と、回答欄３１３に関連付けられた見出し３１４の医学用語とを、候補単語記憶部１２２から取得した学習結果であるモデルに入力する。そして絞込部１１２は、モデルからの出力を候補単語として絞り込む。 The narrowing-down unit 112 acquires a model as a learning result from the candidate word storage unit 122. Next, the narrowing-down unit 112 uses the group of characters acquired by the entry content acquisition unit 111 and the medical term of the heading 314 associated with the answer column 313 as a learning result acquired from the candidate word storage unit 122. input. Then, the narrowing unit 112 narrows down the output from the model as a candidate word.

このように候補単語記憶部１２２が回答欄３１３に関連付けられた医学用語の見出し３１４に基づいて候補単語を絞り込む場合に、各候補単語は、回答欄３１３にいちいち関連付けられている必要はなく、見出し３１４に用いられる限られた数の医学用語に関連付けられていればよい。そのため、例えば、医学用語の見出しが同じで質問を変えた場合にも、医療従事者が絞り込みのためのデータを労せずに流用できる。 When the candidate word storage unit 122 narrows down the candidate words based on the medical term heading 314 associated with the answer column 313 in this way, each candidate word does not have to be associated with the answer column 313 one by one, and the heading It may be associated with a limited number of medical terms used in 314. Therefore, for example, even if the headings of medical terms are the same and the question is changed, the medical staff can divert the data for narrowing down without any effort.

第２の方法として、絞込部１１２は、見出し３１４を用いず、一群の文字が記入された回答欄３１３自体に基づいて、候補単語を絞り込んでもよい。この場合に、候補単語記憶部１２２に記憶された複数の候補単語それぞれは、複数の問診票３１の複数の回答欄３１３それぞれに予め関連付けられている。絞込部１１２は、コード３１６が示す問診票３１の識別情報及び回答欄３１３の位置に基づいて回答欄３１３を特定し、候補単語記憶部１２２に記憶された候補単語を、特定した回答欄３１３に関連付けられた候補単語に絞り込む。この場合に、候補単語記憶部１２２は、見出し３１４の医学用語ではなく、回答欄３１３（回答欄３１３の位置）ごとに一群の文字と単語との関係を示すモデルを記憶する。 As a second method, the narrowing unit 112 may narrow down the candidate words based on the answer column 313 itself in which a group of characters is entered without using the heading 314. In this case, each of the plurality of candidate words stored in the candidate word storage unit 122 is associated with each of the plurality of answer columns 313 of the plurality of questionnaires 31 in advance. The narrowing-down unit 112 identifies the answer column 313 based on the identification information of the questionnaire 31 indicated by the code 316 and the position of the answer column 313, and identifies the candidate word stored in the candidate word storage unit 122 in the specified answer column 313. Narrow down to candidate words associated with. In this case, the candidate word storage unit 122 stores a model showing the relationship between a group of characters and words for each answer column 313 (position of the answer column 313) instead of the medical term of the heading 314.

このように候補単語記憶部１２２が回答欄３１３自体に基づいて候補単語を絞り込む場合に、各候補単語は回答欄３１３に関連付けられている必要があるが、問診票３１から見出し３１４を省略でき、従来の問診票３１をそのまま利用できるという利点がある。 When the candidate word storage unit 122 narrows down the candidate words based on the answer column 313 itself in this way, each candidate word needs to be associated with the answer column 313, but the heading 314 can be omitted from the questionnaire 31. There is an advantage that the conventional questionnaire 31 can be used as it is.

以上のように絞込部１１２は、一群の文字が取得された位置及び機械学習の学習結果を用いて、一群の文字が示す可能性のある候補単語を絞り込むため、回答を特定するのに掛かる処理時間を短縮できるとともに、回答を特定する精度を向上できる。 As described above, the narrowing unit 112 takes to identify the answer in order to narrow down the candidate words that the group of characters may indicate by using the position where the group of characters was acquired and the learning result of machine learning. The processing time can be shortened and the accuracy of identifying the answer can be improved.

［回答を特定する処理の説明］
診療支援システムＳが実行する診療支援方法のうち、回答を特定する処理を以下に説明する。類似情報記憶部１２３は、文字の各組み合わせの文字間距離を示すマトリクス（行列）を予め記憶している。 [Explanation of the process to identify the answer]
Among the medical care support methods executed by the medical care support system S, the process of specifying the answer will be described below. The similar information storage unit 123 stores in advance a matrix (matrix) indicating the inter-character distance of each combination of characters.

図８は、文字間距離を示すマトリクスＭの模式図である。文字間距離は、２つの文字の間の外観上の類似性を示す値であり、例えば２つの文字が類似しているほど値が小さくなるように定義される。 FIG. 8 is a schematic diagram of a matrix M showing the distance between characters. The inter-character distance is a value indicating the appearance similarity between two characters, and is defined so that the value becomes smaller as the two characters are similar.

図８の例では、例えば「ソ」と「ン」は外観上類似しているためそれらの間の文字間距離は小さいのに対して、「あ」と「い」は外観上類似していないためそれらの間の文字間距離は大きい。また、同一の文字間の文字間距離は０である。 In the example of FIG. 8, for example, "so" and "n" are similar in appearance, so the distance between letters is small, whereas "a" and "i" are not similar in appearance. Therefore, the distance between characters is large. The distance between characters of the same character is 0.

文字間距離は、ここに示した定義に限られず、２つの文字が類似しているほど値が大きくなるように定義されてもよく、あるいは２つの文字が類似しているか否かの２値を用いて定義されてもよい。 The inter-character distance is not limited to the definition shown here, and may be defined so that the value becomes larger as the two characters are similar, or the binary value of whether or not the two characters are similar is used. It may be defined using.

図８においてマトリクスＭは視認性のために文字の表で表されているが、マトリクスＭは文字列データ、数値データ、バイナリデータ等のその他形式であってもよい。マトリクスＭは、データベースとして記録されてもよく、あるいはデータを列挙したリストとして記録されてもよい。 In FIG. 8, the matrix M is represented by a table of characters for visibility, but the matrix M may be in other formats such as character string data, numerical data, and binary data. The matrix M may be recorded as a database or as a list listing data.

回答特定部１１３は、類似情報記憶部１２３から文字間距離のマトリクスＭを取得する。回答特定部１１３は、取得したマトリクスＭを用いて、記入内容取得部１１１が取得した一群の文字に含まれる各文字と、絞込部１１２が絞り込んだ候補単語それぞれに含まれる各文字との間の文字間距離を合計し、類似度を算出する。図８の例では、類似度は、一群の文字と候補単語とが類似しているほど小さい値となり、一群の文字と候補単語とが類似しているほど大きい値となる。 The answer specifying unit 113 acquires the matrix M of the inter-character distance from the similar information storage unit 123. The answer specifying unit 113 uses the acquired matrix M to be between each character included in the group of characters acquired by the entry content acquisition unit 111 and each character included in each of the candidate words narrowed down by the narrowing unit 112. Calculate the similarity by summing the distances between characters. In the example of FIG. 8, the similarity becomes smaller as the group of characters and the candidate word are similar, and becomes larger as the group of characters and the candidate word are similar.

具体的には、回答特定部１１３は、比較する一群の文字のＸ文字目（Ｘ：１〜一群の文字の文字数）と候補単語のＸ文字目との間の文字間距離をマトリクスＭから取得し、取得した文字間距離を順に合計していく。回答特定部１１３は、比較する一群の文字と候補単語の文字数が異なる場合には、文字をずらして文字間距離（例えば一群の文字のＸ文字目と候補単語のＸ−１文字目又はＸ＋１文字目との間の文字間距離）を合計してもよい。 Specifically, the answer specifying unit 113 acquires the inter-character distance between the Xth character of the group of characters to be compared (X: the number of characters of the 1st group of characters) and the Xth character of the candidate word from the matrix M. Then, the acquired distances between characters are summed up in order. When the number of characters of the group of characters to be compared and the number of characters of the candidate word are different, the answer identification unit 113 shifts the characters and the distance between the characters (for example, the Xth character of the group of characters and the X-1th character or the X + 1 character of the candidate word) The distance between letters between the eyes) may be totaled.

例えば記入内容取得部１１１が取得した一群の文字が「ロキソプロフェソ」であり、候補単語が「ロキソプロフェン」である場合に、それらの間の差異が１つのみ（しかも文字間距離の小さい「ソ」と「ン」）であるため、類似度が相対的に小さい値となる。記入内容取得部１１１が取得した一群の文字が「ロキソプロフェソ」であり、候補単語が「プラノプロフェン」である場合に、それらの間の差異が４つあるため、類似度が相対的に大きい値となる。 For example, when the group of characters acquired by the entry content acquisition unit 111 is "loxoprofen" and the candidate word is "loxoprofen", there is only one difference between them (and "so" with a small inter-character distance). Since it is "n"), the similarity is relatively small. When the group of characters acquired by the entry content acquisition unit 111 is "Loxoprofen" and the candidate word is "pranoprofen", there are four differences between them, so the similarity is relatively large. It becomes.

そして回答特定部１１３は、絞込部１１２が絞り込んだ候補単語のうち、算出した類似度が所定の条件を満たす候補単語を回答として特定する。図８の例では、類似度が一群の文字と候補単語とが類似しているほど小さい値となるため、回答特定部１１３は類似度が最も低い候補単語を回答として特定する。類似度が一群の文字と候補単語とが類似しているほど大きい値になる場合には、回答特定部１１３は類似度が最も高い候補単語を回答として特定してもよい。 Then, the answer specifying unit 113 specifies, among the candidate words narrowed down by the narrowing unit 112, the candidate words whose calculated similarity satisfies a predetermined condition as an answer. In the example of FIG. 8, the similarity becomes smaller as the characters in the group and the candidate word are similar to each other. Therefore, the answer specifying unit 113 specifies the candidate word having the lowest similarity as the answer. When the similarity becomes so large that the characters in the group and the candidate word are similar to each other, the answer specifying unit 113 may specify the candidate word having the highest similarity as the answer.

このように回答特定部１１３は、２つの文字の類似性を示す文字間距離の値に基づいて、候補単語の中から、画像から取得された一群の文字に対応する回答を特定する。そのため、回答特定部１１３は、画像から取得された一群の文字に文字認識処理の誤りや患者による誤りが含まれていたとしても、問診票３１に記入された回答を高精度に特定できる。また、候補単語が大量にある場合には一群の文字と全ての候補単語との間の類似度を算出するのに多大な時間が掛かるが、本実施形態では絞込部１１２が候補単語を絞り込んでいるため、回答特定部１１３は現実的な処理時間で回答を特定できる。 In this way, the answer specifying unit 113 identifies the answer corresponding to the group of characters acquired from the image from the candidate words based on the value of the inter-character distance indicating the similarity between the two characters. Therefore, the answer identification unit 113 can identify the answer entered in the questionnaire 31 with high accuracy even if the group of characters acquired from the image contains an error in the character recognition process or an error by the patient. Further, when there are a large number of candidate words, it takes a lot of time to calculate the similarity between a group of characters and all the candidate words, but in the present embodiment, the narrowing unit 112 narrows down the candidate words. Therefore, the answer identification unit 113 can specify the answer in a realistic processing time.

候補単語記憶部１２２が候補単語を２つ以上の文字ごとの部分に分割して記憶している場合に、回答特定部１１３は、前記複数の候補単語それぞれに含まれる１つの文字又は複数の文字の組み合わせと、当該部分の複数の文字の組み合わせを、記入内容取得部１１１が取得した一群の文字に含まれる複数の文字の組み合わせとの間の文字間距離の値を取得し、類似度を算出する。 When the candidate word storage unit 122 divides the candidate word into parts for each of two or more characters and stores the candidate word, the answer identification unit 113 includes one character or a plurality of characters included in each of the plurality of candidate words. The value of the inter-character distance between the combination of the above and the combination of the plurality of characters in the relevant part and the combination of the plurality of characters included in the group of characters acquired by the entry content acquisition unit 111 is acquired, and the similarity is calculated. To do.

回答特定部１１３は、記入内容取得部１１１が取得した一群の文字を探索するのにビームサーチを行い、ビームサーチの結果の中で、候補単語との文字間距離を探索し短いものから選択してもよい。なお、この文字間距離に関しても、正解データをもとに機械学習をした結果を用いたり、フォント一覧情報にたいしてａｕｔｏｅｎｃｏｄｉｎｇの学習によって得たパラメータを用いたりしてもよい。 The answer specifying unit 113 performs a beam search to search for a group of characters acquired by the entry content acquisition unit 111, searches for the distance between characters with the candidate word in the beam search results, and selects from the shortest one. You may. As for the distance between characters, the result of machine learning based on the correct answer data may be used, or the parameter obtained by learning autoencoding may be used for the font list information.

また、回答特定部１１３がある回答欄３１３（第１の回答欄）について特定した回答に基づいて、絞込部１１２は、別の回答欄３１３（第２の回答欄）について候補単語を絞り込んでもよい。この場合に、候補単語記憶部１２２は、第１の回答欄３１３の回答と、第２の回答欄の候補単語とを関連付けて予め記憶している。そして絞込部１１２は、第２の回答欄３１３について、第１の回答欄３１３の回答に関連付けられた候補単語に限定して、又は第１の回答欄３１３の回答に関連付けられた候補単語に重み付けをして、上述の絞り込みを行う。 Further, even if the narrowing section 112 narrows down the candidate words for another answer column 313 (second answer column) based on the answer specified for the answer column 313 (first answer column) in which the answer specifying section 113 is located. Good. In this case, the candidate word storage unit 122 stores the answer in the first answer column 313 in advance in association with the candidate word in the second answer column. Then, the narrowing unit 112 limits the second answer column 313 to the candidate words associated with the answer of the first answer column 313, or limits the candidate words associated with the answer of the first answer column 313. Weighting is performed to perform the above-mentioned narrowing down.

例えば回答特定部１１３が既往歴の回答欄３１３について「筋緊張頭痛」の回答を特定した場合に、同じ問診票３１の薬剤歴の回答欄３１３には「ロキソプロフェン」のような筋緊張頭痛に関連する薬剤名が記載されている可能性が高い。そこで絞込部１１２は、薬剤歴の回答欄３１３について、候補単語のうち、筋緊張頭痛に関連する薬剤名に限定して、又は重み付けをして、絞り込みを行う。 For example, when the answer identification unit 113 identifies the answer of "muscle tone headache" in the answer column 313 of the medical history, the answer column 313 of the drug history of the same questionnaire 31 is related to the muscle tone headache such as "loxoprofen". There is a high possibility that the name of the drug to be used is listed. Therefore, the narrowing-down unit 112 narrows down the answer column 313 of the drug history by limiting or weighting the drug names related to muscle tone headache among the candidate words.

出力部１１４は、回答特定部１１３が特定した回答を、患者（患者の識別情報）、回答欄３１３及び回答欄３１３に関連付けられた医学用語（見出し３１４）に関連付けて、回答記憶部１２４に記憶させることによって出力する。また、出力部１１４は、回答特定部１１３が特定した回答を表示するための情報を、診療用端末２に送信することによって出力する。出力部１１４は、その他方法によって情報を出力してもよい。例えば出力部１１４は、プリンタを用いて情報を印刷してもよく、通信手段を用いて外部に情報を送信してもよい。 The output unit 114 stores the answer specified by the answer identification unit 113 in the answer storage unit 124 in association with the patient (patient identification information), the medical term associated with the answer column 313 and the answer column 313 (heading 314). Output by letting. Further, the output unit 114 outputs the information for displaying the answer specified by the answer specifying unit 113 by transmitting the information to the medical care terminal 2. The output unit 114 may output information by other methods. For example, the output unit 114 may print the information using a printer, or may transmit the information to the outside by using a communication means.

図９は、診療用端末２の表示部２１の正面図である。診療用端末２は、診療支援装置１から受信した情報に従って、回答特定部１１３が特定した回答を表示部２１上に表示する。表示部２１は、第１ボタン２１１と、第２ボタン２１２とを表示する。第１ボタン２１１及び第２ボタン２１２は、医療従事者によって押下可能な仮想的なボタンである。第１ボタン２１１が押下された場合に図９の上図の画面が表示され、第２ボタン２１２が押下された場合に図９の下図の画面が表示される。 FIG. 9 is a front view of the display unit 21 of the medical terminal 2. The medical care terminal 2 displays the answer specified by the answer specifying unit 113 on the display unit 21 according to the information received from the medical care support device 1. The display unit 21 displays the first button 211 and the second button 212. The first button 211 and the second button 212 are virtual buttons that can be pressed by a medical professional. When the first button 211 is pressed, the screen shown in the upper part of FIG. 9 is displayed, and when the second button 212 is pressed, the screen shown in the lower part of FIG. 9 is displayed.

表示部２１は、図９の上図の画面において、問診票３１の質問３１２に対応する医学用語２１３と、回答特定部１１３が特定した回答２１４とを関連付けて（すなわち、互いに近傍に）表示する。この場合に、出力部１１４は、問診票３１の質問３１２に対応する医学用語２１３を示すテンプレートを診療用端末２に送信する。このテンプレートは、問診票３１（問診票３１の識別情報）に関連付けられて予め記憶部１２に記憶されている。表示部２１は、受信したテンプレートに応じて、医学用語２１３及び回答２１４を表示する。表示部２１は、自由記入欄である回答欄３１３については回答特定部１１３が特定した回答を回答２１４として表示し、チェックボックスである回答欄３１３についてはチェックボックスのオン又はオフに応じて「はい」又は「いいえ」を回答２１４として表示する。 The display unit 21 displays the medical term 213 corresponding to the question 312 of the questionnaire 31 and the answer 214 specified by the answer specifying unit 113 in association with each other (that is, in the vicinity of each other) on the screen shown in the upper part of FIG. .. In this case, the output unit 114 transmits a template showing the medical term 213 corresponding to the question 312 of the questionnaire 31 to the medical terminal 2. This template is associated with the questionnaire 31 (identification information of the questionnaire 31) and is stored in the storage unit 12 in advance. The display unit 21 displays the medical term 213 and the answer 214 according to the received template. The display unit 21 displays the answer specified by the answer specifying unit 113 as the answer 214 for the answer column 313 which is a free entry field, and displays the answer column 313 which is a check box according to whether the check box is on or off. Or "No" is displayed as answer 214.

表示部２１は、図９の下図の画面において、問診票３１が表す元の質問２１５と、記入内容取得部１１１が取得した問診票３１の記入内容２１６とを関連付けて（すなわち、互いに近傍に）表示する。この場合に、医学用語に対応する平易な表現の質問を参照できるように、レイアウト記憶部１２１は、問診票３１の見出し３１４としての医学用語と、問診票３１の質問３１２とを関連付けて記憶している。出力部１１４は、問診票３１の質問３１２を示すテンプレートを診療用端末２に送信する。このテンプレートは、問診票３１（問診票３１の識別情報）に関連付けられて予め記憶部１２に記憶されている。表示部２１は、受信したテンプレートに応じて、元の質問２１５及び記入内容２１６を表示する。表示部２１は、自由記入欄である回答欄３１３については記入内容取得部１１１が取得した一群の文字を記入内容２１６として表示し、チェックボックスである回答欄３１３についてはオン又はオフのチェックボックスを記入内容２１６として表示する。 On the screen shown in the lower part of FIG. 9, the display unit 21 associates the original question 215 represented by the questionnaire 31 with the entry content 216 of the questionnaire 31 acquired by the entry content acquisition unit 111 (that is, in the vicinity of each other). indicate. In this case, the layout storage unit 121 stores the medical term as the heading 314 of the questionnaire 31 and the question 312 of the questionnaire 31 in association with each other so that the question of the plain expression corresponding to the medical term can be referred to. ing. The output unit 114 transmits a template showing the question 312 of the questionnaire 31 to the medical care terminal 2. This template is associated with the questionnaire 31 (identification information of the questionnaire 31) and is stored in the storage unit 12 in advance. The display unit 21 displays the original question 215 and the entry contents 216 according to the received template. The display unit 21 displays a group of characters acquired by the entry content acquisition unit 111 as the entry content 216 for the answer column 313, which is a free entry field, and displays an on or off check box for the answer column 313, which is a check box. It is displayed as the entry content 216.

このように、出力部１１４は、回答特定部１１３が特定した回答２１４と、問診票３１の元の記入内容２１６とを切り替えて表示部２１に表示させることができるため、医療従事者は、誤りが訂正された後の回答だけでなく、誤りが修正される前の記入内容を確認することができる。また、出力部１１４は、問診票３１の質問３１２に対応する医学用語を表示部２１に表示させることができるため、医療従事者が問診票３１の質問３１２の内容を医学的に理解するための時間を短縮できる。 In this way, the output unit 114 can switch between the answer 214 specified by the answer specifying unit 113 and the original entry content 216 of the questionnaire 31 and display it on the display unit 21, so that the medical staff makes an error. You can check not only the answer after the correction is made, but also the contents before the error is corrected. Further, since the output unit 114 can display the medical terms corresponding to the question 312 of the questionnaire 31 on the display unit 21, the medical staff can medically understand the contents of the question 312 of the questionnaire 31. You can save time.

さらに出力部１１４は、回答特定部１１３が薬剤歴の回答欄３１３（第１の回答欄）において所定の薬剤名（稀な薬剤、糖尿病治療薬等）を特定した場合に、回答特定部１１３が既往歴の回答欄３１３（第２の回答欄）に当該薬剤名に対応する疾患名（病名）を特定したか否かを判定する。既往歴の回答欄３１３に薬剤名に対応する疾患名が特定されていない場合に、出力部１１４は、その旨を示す情報を、診療用端末２に送信することによって出力する。これにより、診療支援システムＳは、問診票３１の複数の回答が整合していないこと等を、医療従事者にアラートとして通知できる。 Further, in the output unit 114, when the response identification unit 113 specifies a predetermined drug name (rare drug, antidiabetic drug, etc.) in the response column 313 (first answer column) of the drug history, the response identification unit 113 It is determined whether or not the disease name (disease name) corresponding to the drug name is specified in the answer column 313 (second answer column) of the medical history. When the disease name corresponding to the drug name is not specified in the answer column 313 of the medical history, the output unit 114 outputs information indicating that fact by transmitting it to the medical care terminal 2. As a result, the medical care support system S can notify the medical staff as an alert that the plurality of answers on the questionnaire 31 are inconsistent.

［診療支援方法のフローチャート］
図１０は、本実施形態に係る診療支援方法（情報処理方法）のフローチャートを示す図である。まず患者は、問診票３１に、質問への回答を記入する。医療従事者又は患者は、問診票３１のうち固定具Ｆを含む一部を切断し、問診票３１を１枚ずつの紙に分ける。そして医療従事者又は患者は、読取装置３に問診票３１を読み取らせる。診療支援装置１において、記入内容取得部１１１は、問診票３１を読み取った読取装置３から、問診票３１の画像を受信する。 [Flowchart of medical care support method]
FIG. 10 is a diagram showing a flowchart of a medical care support method (information processing method) according to the present embodiment. First, the patient fills in the answer to the question on the questionnaire 31. The medical staff or the patient cuts a part of the questionnaire 31 including the fixture F, and divides the questionnaire 31 into sheets of paper. Then, the medical staff or the patient has the reading device 3 read the questionnaire 31. In the medical care support device 1, the entry content acquisition unit 111 receives the image of the questionnaire 31 from the reading device 3 that has read the questionnaire 31.

記入内容取得部１１１は、受信した問診票３１の画像の中で記号３１５を検出し、検出した記号３１５に基づいて回答領域３１１の位置を特定する（Ｓ１１）。記入内容取得部１１１は、特定した回答領域３１１の位置に基づいて、受信した問診票３１の画像を補正する。 The entry content acquisition unit 111 detects the symbol 315 in the image of the received questionnaire 31, and identifies the position of the answer area 311 based on the detected symbol 315 (S11). The entry content acquisition unit 111 corrects the received image of the questionnaire 31 based on the position of the specified answer area 311.

記入内容取得部１１１は、受信した問診票３１の画像の中でコード３１６を検出し、検出したコード３１６が示す問診票３１の識別情報に関連付けられたレイアウト情報を、レイアウト記憶部１２１から取得する。記入内容取得部１１１は、問診票３１の画像の中でレイアウト情報が示す各回答欄３１３の位置を含む画像を取り出す。記入内容取得部１１１は、各回答欄３１３について取り出した画像に対して文字認識処理を適用することによって、回答欄３１３に記入された一群の文字を取得する（Ｓ１２）。 The entry content acquisition unit 111 detects the code 316 in the received image of the questionnaire 31, and acquires the layout information associated with the identification information of the questionnaire 31 indicated by the detected code 316 from the layout storage unit 121. .. The entry content acquisition unit 111 takes out an image including the position of each answer column 313 indicated by the layout information in the image of the questionnaire 31. The entry content acquisition unit 111 acquires a group of characters entered in the answer column 313 by applying the character recognition process to the images taken out from each answer column 313 (S12).

絞込部１１２は、候補単語記憶部１２２から候補単語を取得する（Ｓ１３）。絞込部１１２は、記入内容取得部１１１が文字認識処理によって取得した一群の文字が取得された位置（すなわち該一群の文字がいずれの回答欄３１３に記入されていたか）及び候補単語記憶部１２２に予め記憶された学習結果に基づいて、候補単語を絞り込む（Ｓ１４）。 The narrowing-down unit 112 acquires a candidate word from the candidate word storage unit 122 (S13). The narrowing-down unit 112 includes a position where a group of characters acquired by the entry content acquisition unit 111 by character recognition processing is acquired (that is, which answer column 313 the group of characters was entered in) and a candidate word storage unit 122. The candidate words are narrowed down based on the learning result stored in advance in (S14).

回答特定部１１３は、類似情報記憶部１２３から文字間距離のマトリクスを取得する。回答特定部１１３は、取得したマトリクスを用いて、記入内容取得部１１１が取得した一群の文字に含まれる各文字と、絞込部１１２が絞り込んだ候補単語それぞれに含まれる各文字との間の文字間距離を合計し、類似度を算出する（Ｓ１５）。 The answer specifying unit 113 acquires a matrix of inter-character distances from the similar information storage unit 123. Using the acquired matrix, the answer specifying unit 113 between each character included in the group of characters acquired by the entry content acquisition unit 111 and each character included in each of the candidate words narrowed down by the narrowing unit 112. The distance between characters is totaled to calculate the similarity (S15).

回答特定部１１３は、絞込部１１２が絞り込んだ候補単語のうち、算出した類似度が所定の条件を満たす候補単語を回答として特定する（Ｓ１６）。出力部１１４は、回答特定部１１３が特定した回答を、回答記憶部１２４に記憶させ、また診療用端末２の表示部２１に表示させることによって出力する（Ｓ１７）。 The answer specifying unit 113 specifies as an answer a candidate word whose calculated similarity satisfies a predetermined condition among the candidate words narrowed down by the narrowing unit 112 (S16). The output unit 114 outputs the answer specified by the answer specifying unit 113 by storing it in the answer storage unit 124 and displaying it on the display unit 21 of the medical care terminal 2 (S17).

［本実施形態の効果］
本実施形態に係る診療支援システムＳは、問診票に記入される可能性のある候補単語を、文字が記入された位置及び学習結果に基づいて絞り込んだ上で、その中から回答を特定する。そのため、診療支援システムＳは、回答を特定するのに掛かる処理時間を短縮できるとともに、回答を特定する精度を向上できる。また、診療支援システムＳは、２つの文字の類似性を示す文字間距離の値に基づいて、候補単語の中から、画像から取得された一群の文字に対応する回答を特定する。そのため、診療支援システムＳは、画像から取得された一群の文字に文字認識処理の誤りや患者による誤りが含まれていたとしても、問診票３１に記入された回答を高精度に特定できる。 [Effect of this embodiment]
The medical care support system S according to the present embodiment narrows down the candidate words that may be entered in the questionnaire based on the position where the characters are entered and the learning result, and then identifies the answer from the candidate words. Therefore, the medical care support system S can shorten the processing time required to identify the answer and improve the accuracy of identifying the answer. Further, the medical care support system S identifies an answer corresponding to a group of characters acquired from the image from the candidate words based on the value of the inter-character distance indicating the similarity between the two characters. Therefore, the medical care support system S can identify the answer entered in the questionnaire 31 with high accuracy even if the group of characters acquired from the image contains an error in the character recognition process or an error by the patient.

さらに診療支援システムＳは、２つの記号間の線分に所定の係数を乗算することによって、問診票の中の回答領域の位置を特定し、画像を補正する。そのため、診療支援システムＳは、ステープラ等の固定具を切り取ったこと等により問診票の一部が欠損した場合又は切り取られる可能性のある位置に記号が配置されていない場合であっても、読取装置に問診票を読み取らせる際に発生した画像の歪みや傾きを補正することができ、問診票に記入された文字を高精度に取得できる。 Further, the medical care support system S identifies the position of the answer area in the questionnaire by multiplying the line segment between the two symbols by a predetermined coefficient, and corrects the image. Therefore, the medical care support system S reads even if a part of the questionnaire is missing due to cutting a fixture such as a stapler or the symbol is not placed at a position where the questionnaire may be cut. It is possible to correct the distortion and inclination of the image generated when the device is made to read the questionnaire, and it is possible to acquire the characters written on the questionnaire with high accuracy.

以上、本発明を実施の形態を用いて説明したが、本発明の技術的範囲は上記実施の形態に記載の範囲には限定されず、その要旨の範囲内で種々の変形及び変更が可能である。例えば、装置の分散・統合の具体的な実施の形態は、以上の実施の形態に限られず、その全部又は一部について、任意の単位で機能的又は物理的に分散・統合して構成することができる。また、複数の実施の形態の任意の組み合わせによって生じる新たな実施の形態も、本発明の実施の形態に含まれる。組み合わせによって生じる新たな実施の形態の効果は、もとの実施の形態の効果を合わせ持つ。 Although the present invention has been described above using the embodiments, the technical scope of the present invention is not limited to the scope described in the above embodiments, and various modifications and changes can be made within the scope of the gist. is there. For example, the specific embodiment of the distribution / integration of the device is not limited to the above embodiment, and all or a part thereof may be functionally or physically distributed / integrated in an arbitrary unit. Can be done. Also included in the embodiments of the present invention are new embodiments resulting from any combination of the plurality of embodiments. The effect of the new embodiment produced by the combination has the effect of the original embodiment together.

診療支援装置１のプロセッサは、図１０に示す方法に含まれる各ステップ（工程）の主体となる。すなわち、診療支援装置１のプロセッサは、図１０に示す方法を実行するためのプログラムを記憶部から読み出し、該プログラムを実行して診療支援システムＳの各部を制御することによって、図１０に示す方法を実行する。図１０に示す方法に含まれるステップは一部省略されてもよく、ステップ間の順番が変更されてもよく、複数のステップが並行して行われてもよい。 The processor of the medical care support device 1 is the main body of each step included in the method shown in FIG. That is, the processor of the medical care support device 1 reads a program for executing the method shown in FIG. 10 from the storage unit, executes the program, and controls each part of the medical care support system S, whereby the method shown in FIG. 10 To execute. Some of the steps included in the method shown in FIG. 10 may be omitted, the order between the steps may be changed, and a plurality of steps may be performed in parallel.

Ｓ診療支援システム
１診療支援装置
１１制御部
１１１記入内容取得部
１１２絞込部
１１３回答特定部
１１４出力部
１２記憶部
１２１レイアウト記憶部
１２２候補単語記憶部
１２３類似情報記憶部
３１問診票
３１１回答領域
３１３回答欄
３１５記号 S Medical treatment support system 1 Medical treatment support device 11 Control unit 111 Entry content acquisition unit 112 Narrowing unit 113 Answer specific unit 114 Output unit 12 Storage unit 121 Layout storage unit 122 Candidate word storage unit 123 Similar information storage unit 31 Questionnaire 311 Answer area 313 Answer column 315 Symbol

Claims

A layout storage unit that stores the layout of the questionnaire in association with the identification information that identifies the questionnaire, and
Candidate word storage unit that stores multiple candidate words,
An acquisition unit that acquires the identification information from the image of the questionnaire and acquires one or more characters included in the image based on the layout associated with the acquired identification information.
A narrowing section that narrows down the plurality of candidate words based on the position where one or more characters are acquired, and
A specific unit that identifies the answer entered in the questionnaire by comparing the one or more characters acquired by the acquisition unit with the plurality of candidate words narrowed down by the narrowing unit.
An output unit that outputs the answer specified by the specific unit, and an output unit that outputs the answer.
Information processing device with.

The specific unit calculates the similarity between the one or more characters acquired by the acquisition unit and each of the plurality of candidate words narrowed down by the narrowing unit, and selects the specific unit based on the calculated similarity. The information processing apparatus according to claim 1, wherein one of the plurality of candidate words is specified as the answer.

It further has a similar information storage unit that stores a value indicating similarity between two characters.
The specific unit is a combination of one character or a plurality of characters included in the one or more characters acquired by the acquisition unit and a combination of one character or a plurality of characters included in each of the plurality of candidate words. The information processing apparatus according to claim 2, wherein the value between the two is acquired, and the similarity is calculated using the acquired value.

The layout storage unit stores the layout indicating the position of the answer column included in the questionnaire with reference to the predetermined position.
The information processing device according to any one of claims 1 to 3, wherein the acquisition unit acquires the one or more characters at the position of the answer column indicated by the layout in the image.

The candidate word storage unit stores the plurality of candidate words in association with the answer column, and stores the plurality of candidate words.
The narrowing unit narrows down the plurality of candidate words by acquiring the plurality of candidate words associated with the answer column in which one or more characters are acquired from the candidate word storage unit. The information processing apparatus according to any one of 4 to 4.

The candidate word storage unit stores the plurality of candidate words in association with medical terms, and stores the plurality of candidate words.
In addition to the one or more characters, the acquisition unit acquires the medical term associated with the answer column in which the one or more characters are acquired in the image.
The narrowing unit narrows down the plurality of candidate words by acquiring the plurality of candidate words associated with the medical term acquired by the acquisition unit from the candidate word storage unit.
The information processing device according to any one of claims 1 to 4, wherein the output unit outputs the answer in association with the medical term.

The layout storage unit stores the questions described in the questionnaire in association with the medical terms.
The information processing device according to claim 6, wherein the output unit outputs the question in addition to the answer and the medical terminology.

The narrowing-down unit is based on the answer specified by the specific unit for the first answer column at the position where one or more characters are acquired, and the second answer column is different from the first answer column. , The information processing apparatus according to any one of claims 1 to 7, which narrows down the plurality of candidate words.

The candidate word storage unit is in at least one state of each of the plurality of candidate words being one word, one word divided into a plurality of parts, and a plurality of words combined. The information processing apparatus according to any one of claims 1 to 8, which is stored.

The acquisition unit identifies a line segment connecting two predetermined symbols included in the image, and the answer is entered in the questionnaire by multiplying the length of the line segment by a predetermined coefficient. The information processing apparatus according to any one of claims 1 to 9, which calculates the length of a side that divides a possible region and corrects the image based on the length.

A computer having a layout storage unit that stores the layout of the questionnaire in association with the identification information that identifies the questionnaire, and a candidate word storage unit that stores a plurality of candidate words.
A step of acquiring the identification information from the image of the questionnaire and acquiring one or more characters included in the image based on the layout associated with the acquired identification information.
A step of narrowing down the plurality of candidate words based on the position where one or more characters are acquired, and
A step of identifying the answer entered in the questionnaire by comparing the one or more characters acquired by the acquisition step with the plurality of candidate words narrowed down by the narrowing step.
The step of outputting the answer specified by the specific step and the step of outputting the answer
Information processing method having.

A computer having a layout storage unit that stores the layout of the questionnaire in association with the identification information that identifies the questionnaire, and a candidate word storage unit that stores a plurality of candidate words.
A step of acquiring the identification information from the image of the questionnaire and acquiring one or more characters included in the image based on the layout associated with the acquired identification information.
A step of narrowing down the plurality of candidate words based on the position where one or more characters are acquired, and
A step of identifying the answer entered in the questionnaire by comparing the one or more characters acquired by the acquisition step with the plurality of candidate words narrowed down by the narrowing step.
The step of outputting the answer specified by the specific step and the step of outputting the answer
An information processing program that executes.