JP6699825B2

JP6699825B2 - Diagnostic device, diagnostic device control method, and diagnostic program

Info

Publication number: JP6699825B2
Application number: JP2016036988A
Authority: JP
Inventors: 英治荒牧
Original assignee: Nara Institute of Science and Technology NUC
Current assignee: Nara Institute of Science and Technology NUC
Priority date: 2016-02-29
Filing date: 2016-02-29
Publication date: 2020-05-27
Anticipated expiration: 2036-02-29
Also published as: JP2017156402A

Description

本発明は、言語に症状が現れる疾患を診断する診断装置等に関するものである。 The present invention relates to a diagnostic device and the like for diagnosing a disease in which symptoms appear in language.

認知症患者は、症状が進行するにつれて、使用できる単語数が減少傾向へ向かうので、認知症は言語能力と相関があることが知られている。ここで、言語能力の測定は、肉体的侵襲がないため、認知症を早期に発見する手法として注目されている。 It is known that dementia correlates with verbal ability because the number of words that can be used tends to decrease in patients with dementia as the symptoms progress. Here, the measurement of language ability has attracted attention as a method for early detection of dementia because it has no physical invasion.

また、早期に認知症を発見する手法として、長谷川式簡易知能評価スケール（ＨＤＳ−Ｒ）や、Ｍｅ−ＣＤＴが知られており、これらの手法を用いて認知症の疑いのある高齢者と健常高齢者との発話の傾向及び特徴を抽出する研究も報告されている（非特許文献１）。 Further, as a method for early detection of dementia, the Hasegawa-type simplified intelligence evaluation scale (HDS-R) and Me-CDT are known, and using these methods, elderly people suspected of dementia and normal A study that extracts the tendency and characteristics of speech with the elderly has also been reported (Non-Patent Document 1).

四方朱子、宮部真衣、野田泰葉、木下彩栄、荒牧英治、「軽度認知症者の音声テキストの質的検討：認知症の無侵襲迅速スクリーニングの為に」、情報処理学会研究報告Ｖｏｌ．２０１５−ＵＢＩ−４７Ｎｏ．４Ｖｏｌ．２０１５−ＡＳＤ−２Ｎｏ．４、２０１５年７月２７日Akiko Shikata, Mai Miyabe, Yasuha Noda, Aya Kinoshita, Eiji Aramaki, "A Qualitative Study of Speech Text for People with Mild Dementia: For Rapid Non-Invasive Screening of Dementia," Research Report of Information Processing Society of Japan, Vol. 2015-UBI-47 No. 4 Vol. 2015-ASD-2 No. 4, July 27, 2015

一般に言語能力は、構文能力と語彙能力との２つに大別されるが、認知症に対しては語彙能力が有用であることが知られている。語彙能力の測定手法としては、理解できる或いは使用できる語彙量を測定する手法や、認証心理学で用いられる命題数を測定する手法などが知られている。 Generally speaking, linguistic ability is roughly divided into syntactic ability and vocabulary ability, but it is known that vocabulary ability is useful for dementia. As a method of measuring vocabulary ability, a method of measuring the amount of vocabulary that can be understood or used and a method of measuring the number of propositions used in authentication psychology are known.

しかし、語彙量を測定する手法では、言語に応じて単語の単位が異なるので、単語の単位に曖昧さがあるという課題があると共に、言語に応じてアルゴリズムを変更する必要があるという課題がある。 However, in the method of measuring vocabulary, since the unit of the word differs depending on the language, there is a problem that there is ambiguity in the unit of the word, and there is a problem that the algorithm needs to be changed according to the language. ..

また、命題数を測定する手法では、専門家による人手の作業が必要となるという問題がある。なお、品詞をカウントすることで、命題数を自動解析するＣＰＩＤＲと呼ばれる手法が提案されているものの、この手法は英語に対する手法であり、そのまま別の言語に適用させることは困難である。 Further, the method of measuring the number of propositions has a problem that manual work by an expert is required. Although a method called CPIDR that automatically analyzes the number of propositions by counting part-of-speech has been proposed, this method is for English, and it is difficult to apply it to another language as it is.

また、上記の非特許文献１では、公知の評価手法を利用して認知症患者と健常者との発話の特徴が抽出されているに留まり、認知症の新たな判定手法についての開示はない。 Further, in Non-Patent Document 1 described above, only a feature of speech between a dementia patient and a healthy person is extracted using a known evaluation method, and there is no disclosure about a new method for determining dementia.

本発明の目的は、言語に依存することなく簡易且つ正確に人物が言語に症状が現れる疾患の疑いを持つか否かを判定できる技術を提供することである。 An object of the present invention is to provide a technique capable of easily and accurately determining whether or not a person has a suspicion of a disease in which a symptom appears in a language without depending on the language.

本発明の一態様に係る診断装置は、言語に症状が現れる疾患を診断する診断装置であって、
前記人物の発話内容又は前記人物が書き起こした文書を示すテキストデータを取得する取得部と、
前記取得部により取得されたテキストデータを、同じ単語の繰り返し回数が多いほど圧縮率が高くなるデータ圧縮方式によって可逆圧縮する圧縮部と、
前記圧縮部により可逆圧縮されたテキストデータの圧縮率を算出する圧縮率算出部と、
前記圧縮率算出部により算出された圧縮率に基づいて、前記人物が前記疾患の疑いを持つか否かを判定する判定部とを備える。 A diagnostic device according to one aspect of the present invention is a diagnostic device for diagnosing a disease in which symptoms appear in language,
An acquisition unit that acquires text data indicating the utterance content of the person or the document transcribed by the person,
A compression unit that reversibly compresses the text data acquired by the acquisition unit by a data compression method in which the compression rate increases as the number of repetitions of the same word increases ;
A compression rate calculation section for calculating the compression rate of the text data losslessly compressed by the compression section;
And a determination unit that determines whether or not the person has a suspicion of the disease based on the compression ratio calculated by the compression ratio calculation unit.

認知症等の言語に症状が表れる疾患の疑いを持つ患者の発話内容を解析すると、同じ単語や同じフレーズを繰り返す傾向にあり、冗長性が高いことが知られている。可逆圧縮のアルゴリズムでは、冗長なフレーズを抽出し、そのフレーズに短い符号を割り付けることで元のデータが圧縮されるので、元のデータに冗長なフレーズが多く含まれるほど圧縮率が高くなる。そのため、言語に症状が現れる疾患の疑い持つ人物と疑いを持たない人物との発話内容を示すテキストデータを可逆圧縮したときの圧縮率を比較すると、言語に症状が現れる疾患の疑い持つ人物の方が疑いを持たない人物よりも高くなることを本発明者は見出した。 It is known that when analyzing the utterance content of a patient who has a suspicion of a disease in which symptoms such as dementia appear in the language, the same word or the same phrase tends to be repeated and redundancy is high. In the lossless compression algorithm, since the original data is compressed by extracting a redundant phrase and assigning a short code to the phrase, the compression rate becomes higher as the original data contains more redundant phrases. Therefore, comparing the compression ratios when lossless compression was performed on text data indicating the utterance contents of a person suspected of having a symptom in language and a person not having a suspicion, the person having a suspicion of a disease having symptom in language was compared. The present inventor has found that is higher than that of a person who has no doubt.

本態様では、人物の発話内容を示すテキストデータを可逆圧縮したときの圧縮率に基づいて、人物が言語に症状が現れる疾患の疑いを持つか否かが判定されている。そのため、言語に依存することなく、共通のアルゴリズムで人物が言語に症状が現れる疾患の疑いを持つか否かを判定できる。その結果、簡易且つ正確に言語に症状が現れる疾患を判定できる。 In this aspect, it is determined whether or not the person has a suspicion of a disease in which a symptom appears in the language, based on the compression rate when the text data indicating the utterance content of the person is losslessly compressed. Therefore, it is possible to determine whether or not a person has a suspicion of a disease in which a symptom appears in a language by a common algorithm without depending on the language. As a result, it is possible to easily and accurately determine a disease in which symptoms appear in language.

また、上記態様において、
前記人物が発話した音声を収音する収音部と、
前記収音された音声を音声認識することで、前記収音した音声を前記テキストデータに変換し、前記取得部に出力する音声認識部とを更に備えていてもよい。 In the above aspect,
A sound pickup unit that picks up the voice uttered by the person,
A voice recognition unit that converts the collected voice into the text data and outputs the voice data to the acquisition unit by recognizing the collected voice by voice may be further provided.

本態様によれば、人物が発話した音声を音声認識することでテキストデータが取得されているので、人物に過度な負担を強いることなくテキストデータを取得できる。 According to this aspect, since the text data is acquired by recognizing the voice uttered by the person, the text data can be acquired without imposing an excessive burden on the person.

また、上記態様において、
前記取得部は、前記人物の発話内容が記載された文書又は前記人物が書き起こした文書をテキスト認識することで前記テキストデータを取得してもよい。 In the above aspect,
The acquisition unit may acquire the text data by text-recognizing a document in which the utterance content of the person is described or a document transcribed by the person.

本態様によれば、人物の発話内容が記載された文書又は人物が書き起こした文書をテキスト認識することでテキストデータが取得されているので、簡易な処理でテキストデータを取得できる。 According to this aspect, since the text data is acquired by recognizing the text in which the utterance content of the person is described or the document transcribed by the person, the text data can be acquired by a simple process.

また、上記態様において、前記判定部は、前記圧縮率が所定の閾値より高い場合、前記人物は前記疾患の疑いがあると判定してもよい。 Further, in the above aspect, the determination unit may determine that the person has a suspicion of the disease when the compression rate is higher than a predetermined threshold value.

本態様によれば、圧縮率が所定の閾値より高ければ、人物は前記疾患の疑いを持つと判定されるので、簡便なアルゴリズムで言語に症状が現れる疾患を判定できる。 According to this aspect, if the compression rate is higher than the predetermined threshold value, it is determined that the person has the suspicion of the disease, and thus the disease in which the symptom appears in the language can be determined by a simple algorithm.

また、上記態様において、前記判定部は、前記圧縮率が高くなるにつれて前記疾患の度合いが高いと判定してもよい。 Further, in the above aspect, the determination unit may determine that the degree of the disease is higher as the compression rate is higher.

本態様によれば、前記疾患の有無のみならず、人物がどの程度、前記疾患の疑いを持つかを提示できる。 According to this aspect, it is possible to present not only the presence or absence of the disease but also to what extent the person has the suspicion of the disease.

また、上記態様において、前記疾患は、認知症、脳疾患、及び精神障害を含んでいてもよい。 Further, in the above aspect, the disease may include dementia, brain disease, and mental disorder.

本態様によれば、言語能力との相関が高い、認知症、脳疾患、及び精神障害を正確に判定できる。 According to this aspect, it is possible to accurately determine dementia, brain diseases, and mental disorders that have a high correlation with language ability.

本発明によれば、言語に依存することなく簡易且つ正確に人物が、言語に症状が現れる疾患の疑いを持つか否かを判定できる。 According to the present invention, it is possible to easily and accurately determine whether or not a person has a suspicion of a disease in which a symptom appears in language without depending on language.

本発明の実施の形態１に係る診断装置１の全体構成を示すブロック図である。It is a block diagram which shows the whole structure of the diagnostic device 1 which concerns on Embodiment 1 of this invention. 圧縮部による圧縮処理の一例を示す図である。It is a figure which shows an example of the compression process by a compression part. 本発明の実施の形態１に係る診断装置の処理の一例を示すフローチャートである。5 is a flowchart showing an example of processing of the diagnostic device according to the first embodiment of the present invention. 本実施の形態の実験で用いた言語指標を纏めた表である。6 is a table summarizing language indexes used in the experiment of the present embodiment. 本実施の形態における実験結果を纏めた表である。6 is a table summarizing the experimental results in the present embodiment. 本発明の実施の形態２に係る診断装置の全体構成を示すブロック図である。It is a block diagram which shows the whole structure of the diagnostic device which concerns on Embodiment 2 of this invention. 本発明の実施の形態２に係る診断装置の処理の一例を示すフローチャートである。9 is a flowchart showing an example of processing of the diagnostic device according to the second embodiment of the present invention. 本発明の実施の形態３に係る診断装置の全体構成を示す図である。It is a figure which shows the whole structure of the diagnostic device which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係る診断装置の処理の一例を示すフローチャートである。It is a flowchart which shows an example of a process of the diagnostic device which concerns on Embodiment 3 of this invention. 本発明の実施の形態４に係る診断装置の全体構成を示すブロック図である。It is a block diagram which shows the whole structure of the diagnostic device which concerns on Embodiment 4 of this invention. 本発明の実施の形態４に係る診断装置の処理の一例を示すフローチャートである。It is a flow chart which shows an example of processing of a diagnostic device concerning Embodiment 4 of the present invention.

（実施の形態１）
図１は、本発明の実施の形態１に係る診断装置１の全体構成を示すブロック図である。診断装置１は、言語に症状が現れる疾患の疑いを診断する装置である。以下の実施の形態１〜４では、言語に症状が現れる疾患として認知症を例に挙げて説明するが、これに限定されない。例えば、失語症や発達障害（アスペルガー症候群、学習障害、及び多動性障害等）に対しても、本発明は適用可能である。すなわち、本実施の形態において、言語に症状が現れる疾患とは、何らかの要因によって言語に支障をきたす疾患が該当し、認知症の他、言語に症状が現れる脳疾患や精神障害（例えば、鬱病）が含まれる。 (Embodiment 1)
1 is a block diagram showing the overall configuration of a diagnostic device 1 according to Embodiment 1 of the present invention. The diagnosis device 1 is a device for diagnosing a suspicion of a disease in which symptoms appear in language. In the following first to fourth embodiments, dementia will be described as an example of a disease in which a symptom appears in language, but the present invention is not limited to this. For example, the present invention is applicable to aphasia and developmental disorders (Asperger syndrome, learning disorder, hyperactivity disorder, etc.). That is, in the present embodiment, the disease in which language is manifested corresponds to a disease in which language is impaired due to some factor, and in addition to dementia, brain disease or mental disorder (eg, depression) in which language is manifested. Is included.

ここでは、認知症には、アルツハイマー型認知症、脳血管型認知症、レビー小体型認知症、及び前頭側頭型認知症等が含まれる。また、本実施の形態においては、認知症には、ＭＣＩ（ＭｉｌｄＣｏｇｎｉｔｉｖｅＩｍｐａｉｒｍｅｎｔ：軽度認知障害）も含まれる。 Here, the dementia includes Alzheimer's dementia, cerebrovascular dementia, Lewy body dementia, frontotemporal dementia and the like. Further, in the present embodiment, the dementia also includes MCI (Mild Cognitive Impairment).

図１において、診断装置１は、携帯端末１００、音声認識サーバ２００、及び診断サーバ３００を備える。携帯端末１００は、例えば、スマートフォン、タブレット端末といった、タッチパネル１０２を備える携帯可能な情報処理装置で構成されている。但し、これは一例であり、携帯端末１００としては、タッチパネル１０２を備えていない携帯電話が採用されてもよい。 In FIG. 1, the diagnostic device 1 includes a mobile terminal 100, a voice recognition server 200, and a diagnostic server 300. The mobile terminal 100 is composed of a portable information processing device including a touch panel 102, such as a smartphone or a tablet terminal. However, this is an example, and a mobile phone that does not include the touch panel 102 may be adopted as the mobile terminal 100.

音声認識サーバ２００及び診断サーバ３００は、それぞれ、通信機能を備えるコンピュータで構成されている。携帯端末１００、音声認識サーバ２００、及び診断サーバ３００はネットワークＮＴを介して相互に通信可能に接続されている。ネットワークＮＴとしては、携帯電話通信網及びインターネット通信網を含む公衆通信網が採用できる。携帯端末１００、音声認識サーバ２００、及び診断サーバ３００は、ＴＣＰ／ＩＰ等の通信プロトコルを用いて種々のデータを送受する。 The voice recognition server 200 and the diagnosis server 300 are each configured by a computer having a communication function. The mobile terminal 100, the voice recognition server 200, and the diagnosis server 300 are communicably connected to each other via the network NT. As the network NT, a public communication network including a mobile phone communication network and an internet communication network can be adopted. The mobile terminal 100, the voice recognition server 200, and the diagnostic server 300 send and receive various data using a communication protocol such as TCP/IP.

携帯端末１００は、認知症の診断対象となる人物の音声を収音すると共に、診断結果を人物に提示する装置であり、収音部１０１、タッチパネル１０２、制御部１０３、及び通信部１０４を備える。 The mobile terminal 100 is a device that collects a voice of a person who is a diagnosis target of dementia and presents the diagnosis result to the person, and includes a sound collecting unit 101, a touch panel 102, a control unit 103, and a communication unit 104. .

収音部１０１は、例えば、人物の発話した音声を収音して音声信号に変換するマイクと、マイクによって変換された音声信号に対して所定の信号処理を行う信号処理回路等を含む。ここで、信号処理としては、音声信号に含まれるノイズを除去するといった前処理や、アナログの音声信号をデジタルの音声データに変換する処理等が含まれる。 The sound pickup unit 101 includes, for example, a microphone that picks up a voice uttered by a person and converts the voice into a voice signal, a signal processing circuit that performs predetermined signal processing on the voice signal converted by the microphone, and the like. Here, the signal processing includes preprocessing such as removing noise included in the audio signal, processing of converting an analog audio signal into digital audio data, and the like.

タッチパネル１０２は、制御部１０３の制御の下、診断サーバ３００から送信された診断結果を表示したり、診断対象の人物に発話を促すメッセージを含む画面を表示したりする。 Under the control of the control unit 103, the touch panel 102 displays the diagnosis result transmitted from the diagnosis server 300 and displays a screen including a message prompting the person to be diagnosed to speak.

制御部１０３は、ＣＰＵ、ＲＯＭ、及びＲＡＭ等を備え、携帯端末１００の全体制御を司る。本実施の形態では、制御部１０３は、収音部１０１から出力された音声データを通信部１０４を用いて音声認識サーバ２００に送信する処理や、診断サーバ３００から送信された診断結果をタッチパネル１０２に表示する処理を行う。 The control unit 103 includes a CPU, a ROM, a RAM, and the like, and controls the entire mobile terminal 100. In the present embodiment, the control unit 103 transmits the voice data output from the sound collection unit 101 to the voice recognition server 200 using the communication unit 104 and the diagnosis result transmitted from the diagnosis server 300 on the touch panel 102. Perform the processing to be displayed on.

通信部１０４は、携帯端末１００をネットワークＮＴに接続するための通信装置で構成されている。本実施の形態では、通信部１０４は、制御部１０３の制御の下、音声認識サーバ２００に音声データを送信すると共に、診断サーバ３００から送信された診断結果を受信する。 The communication unit 104 is composed of a communication device for connecting the mobile terminal 100 to the network NT. In the present embodiment, the communication unit 104 transmits voice data to the voice recognition server 200 and receives the diagnosis result transmitted from the diagnosis server 300 under the control of the control unit 103.

音声認識サーバ２００は、携帯端末１００から送信された音声データをテキストデータに変換する処理を司り、音声認識部２０１及び通信部２０２を備える。音声認識部２０１は、携帯端末１００から送信された音声データに対して音声認識処理を行い、テキストデータに変換する。ここで、音声認識処理としては、例えば、音の波形データを蓄積する音響モデルと、単語及びその並び方の情報を蓄積する言語モデルとを用いることで音声認識を行う公知の処理が採用されればよい。 The voice recognition server 200 manages a process of converting voice data transmitted from the mobile terminal 100 into text data, and includes a voice recognition unit 201 and a communication unit 202. The voice recognition unit 201 performs voice recognition processing on the voice data transmitted from the mobile terminal 100 and converts it into text data. Here, as the voice recognition process, for example, if a known process of performing voice recognition by using an acoustic model for accumulating sound waveform data and a language model for accumulating information on words and their arrangement is adopted. Good.

通信部２０２は、音声認識サーバ２００をネットワークＮＴに接続する通信装置で構成され、携帯端末１００から送信された音声データを受信すると共に、音声認識部２０１により変換されたテキストデータを診断サーバ３００に送信する。 The communication unit 202 is configured by a communication device that connects the voice recognition server 200 to the network NT, receives voice data transmitted from the mobile terminal 100, and transmits the text data converted by the voice recognition unit 201 to the diagnostic server 300. Send.

診断サーバ３００は、人物が認知症の疑いを持つか否かを診断する装置であり、圧縮部３０１、圧縮率算出部３０２、判定部３０３、及び通信部３０４を備える。 The diagnosis server 300 is a device that diagnoses whether a person has a suspicion of dementia, and includes a compression unit 301, a compression ratio calculation unit 302, a determination unit 303, and a communication unit 304.

圧縮部３０１は、音声認識サーバ２００から送信されたテキストデータを可逆圧縮する。ここで、圧縮部３０１は、ＺＩＰ、ＬＺＨ、ＴＡＲＧＺ、ＣＡＢ等の公知の可逆圧縮方式であればどのような圧縮方式を用いてもよい。 The compression unit 301 losslessly compresses the text data transmitted from the voice recognition server 200. Here, the compression unit 301 may use any compression method as long as it is a known lossless compression method such as ZIP, LZH, TRGZ, CAB.

図２は、圧縮部３０１による圧縮処理の一例を示す図である。一般に可逆圧縮では、辞書法と符号割当との２つの工程を経てデータが圧縮される。辞書法の工程では、処理対象となるテキストデータの冗長性を減らすために、テキストデータおいて繰り返し登場する語が符号化される。 FIG. 2 is a diagram showing an example of compression processing by the compression unit 301. Generally, in lossless compression, data is compressed through two steps of dictionary method and code assignment. In the dictionary method, words that appear repeatedly in the text data are encoded in order to reduce the redundancy of the text data to be processed.

図２に示すテキストデータＴＸは、認知症患者の発話した内容を書き起こしたものである。図２の例では、テキストデータＴＸにおいて、「あのー」、「バイクの」、「とか」といった繰り返し登場する単語に対して一意的に識別可能な符号が割り付けられている。これにより、テキストデータＴＸは、図２の中段に示すようなテキストデータＴＸ１に変換される。 The text data TX shown in FIG. 2 is a transcription of the contents uttered by a dementia patient. In the example of FIG. 2, in the text data TX, a uniquely identifiable code is assigned to repeatedly appearing words such as “Ah,” “Bike,” and “Toka.” As a result, the text data TX is converted into the text data TX1 as shown in the middle part of FIG.

次に、テキストデータＴＸ１に対して、符号割当の工程が施される。符号割当の工程では、例えば、ハフマン法が採用され、登場回数が多い単語ほど、短い符号が割り付けられる。これにより、最終的にテキストデータＴＸ２が得られる。この例では、６５文字の１３０バイトのテキストデータＴＸが最終的に４３文字の８６バイトのテキストデータＴＸ２に圧縮されている。その結果、圧縮率は、８６／１３０＝６６％となっている。 Next, a code allocation step is performed on the text data TX1. In the code assignment step, for example, the Huffman method is adopted, and a shorter code is assigned to a word having a higher appearance frequency. As a result, the text data TX2 is finally obtained. In this example, 65-character 130-byte text data TX is finally compressed into 43-character 86-byte text data TX2. As a result, the compression rate is 86/130=66%.

図２のテキストデータＴＸに示すように、認知症の疑いのある人物は、同じ単語を繰り返し使用する傾向が高いので、認知症の疑いのある人物の発話内容を示すテキストデータＴＸの圧縮率は、認知症の疑いのない人物の圧縮率に比べて高くなる。そのため、圧縮率から診断対象の人物が認知症の疑いを持つか否かを判断できる。 As shown in the text data TX of FIG. 2, since a person with a suspicion of dementia has a high tendency to repeatedly use the same word, the compression ratio of the text data TX indicating the utterance content of the person with a suspicion of dementia is , Higher than the compression rate for people without dementia. Therefore, it can be determined from the compression rate whether the person to be diagnosed has a suspicion of dementia.

この傾向は、辞書法及び符号割当を使用する可逆圧縮方式であれば、どのような圧縮方式においても表れるので、本実施の形態は辞書法及び符号割当からなる既存の可逆圧縮方式を適用できる。また、このような既存の可逆圧縮方式は、処理対象となるデータ内容を問わず、同じ単語の繰り返し回数が多いほど、高い圧縮率が得られる。したがって、本実施の形態は、発話者が使用する言語に応じてアルゴリズムを変更する必要もない。 This tendency appears in any compression method as long as it is a lossless compression method using the dictionary method and code assignment, and thus the present embodiment can apply the existing lossless compression method including the dictionary method and code assignment. In addition, in such an existing reversible compression method, a higher compression rate can be obtained as the number of repetitions of the same word increases, regardless of the data content to be processed. Therefore, in the present embodiment, it is not necessary to change the algorithm according to the language used by the speaker.

なお、本実施の形態で、可逆圧縮方式を採用しているのは、既存の非可逆圧縮方式は主に画像データといったテキストデータ以外のデータを対象としてアルゴリズムが構築されており、テキストデータの圧縮にはなじまないからである。 In the present embodiment, the lossless compression method is adopted because the existing lossy compression method has an algorithm constructed mainly for data other than text data such as image data. Because it does not fit in.

図１に参照を戻す。圧縮率算出部３０２は、圧縮部３０１により圧縮されたテキストデータの圧縮率を算出する。ここで、圧縮率算出部３０２は、圧縮前のテキストデータＴＸのデータ量に対する、圧縮後のテキストデータＴＸ２のデータ量の割合を求めることで、圧縮率を算出すればよい。 Referring back to FIG. The compression rate calculation unit 302 calculates the compression rate of the text data compressed by the compression unit 301. Here, the compression rate calculation unit 302 may calculate the compression rate by obtaining the ratio of the data amount of the compressed text data TX2 to the data amount of the uncompressed text data TX.

判定部３０３は、圧縮率算出部３０２により算出された圧縮率が所定の閾値より大きければ、診断対象の人物は認知症の疑いがあると判定し、当該圧縮率が所定の閾値より大きくなければ、診断対象の人物は認知症の疑いがないと判定する。ここで、判定部３０３は、圧縮率が大きくなるにつれて、値が増大するように認知症の度合いを算出してもよい。例えば、判定部３０３は、圧縮率が閾値以下であれば、認知症の疑いがないことを示す認知症度「０」を、認知症の度合いとして算出すればよい。また、判定部３０３は、圧縮率が閾値よりも大きい場合は、圧縮率が増大するにつれて、認知症度「１」、認知症度「２」というように、認知症の度合いを段階的に算出してもよい。そして、判定部３０３は、判定結果を示すデータを診断結果として生成する。ここで、閾値としては、例えば、多数の人物に対して実験を施すことにより得られた値であって、これ以上値が増大すると認知症の疑いがあること判断できる値が採用されればよい。 If the compression rate calculated by the compression rate calculation section 302 is larger than the predetermined threshold value, the determination section 303 determines that the person to be diagnosed is suspected of having dementia, and if the compression rate is not higher than the predetermined threshold value. , It is determined that the person to be diagnosed has no suspicion of dementia. Here, the determination unit 303 may calculate the degree of dementia such that the value increases as the compression rate increases. For example, the determination unit 303 may calculate the dementia degree “0” indicating that there is no suspicion of dementia as the degree of dementia when the compression rate is equal to or less than the threshold value. Further, when the compression rate is larger than the threshold value, the determination unit 303 calculates the degree of dementia step by step, such as dementia degree “1” and dementia degree “2” as the compression rate increases. You may. Then, the determination unit 303 generates data indicating the determination result as the diagnosis result. Here, as the threshold value, for example, a value obtained by conducting an experiment on a large number of people, and a value that can be suspected of dementia when the value increases more than this may be adopted. ..

通信部３０４は、診断サーバ３００をネットワークに接続するための通信装置で構成され、音声認識サーバ２００から送信されたテキストデータを受信すると共に、判定部３０３により生成された診断結果を携帯端末１００に送信する。これを受信した携帯端末１００は、タッチパネル１０２に診断結果を表示する。 The communication unit 304 includes a communication device for connecting the diagnosis server 300 to the network, receives the text data transmitted from the voice recognition server 200, and transmits the diagnosis result generated by the determination unit 303 to the mobile terminal 100. Send. Upon receiving this, the mobile terminal 100 displays the diagnosis result on the touch panel 102.

なお、図１において圧縮部３０１〜判定部３０３は、例えば、ＣＰＵがプログラムを実行することで実現される。また、本実施の形態において、通信部３０４は、請求項の取得部に相当する。 Note that, in FIG. 1, the compression unit 301 to the determination unit 303 are realized by, for example, the CPU executing a program. Further, in the present embodiment, the communication unit 304 corresponds to the obtaining unit in the claims.

図３は、本発明の実施の形態１に係る診断装置１の処理の一例を示すフローチャートである。まず。携帯端末１００は、診断対象となる人物（以下、「診断対象者」と記述する。）の音声を収音し、音声データを取得する（Ｓ１０１）。ここで、診断対象者は、例えば、携帯端末１００のタッチパネル１０２に表示される、認知症の疑いを診断するための発話誘導メッセージにしたがって発話すればよい。或いは、診断対象者は、医師や看護師（以下、「医師等」と記述する。）との対話を通じて発話すればよい。 FIG. 3 is a flowchart showing an example of processing of the diagnostic device 1 according to the first embodiment of the present invention. First. The mobile terminal 100 picks up the voice of a person to be diagnosed (hereinafter referred to as “diagnosis subject”) and acquires voice data (S101). Here, the diagnosis target person may speak, for example, according to a speech guidance message for diagnosing the suspicion of dementia, which is displayed on the touch panel 102 of the mobile terminal 100. Alternatively, the person to be diagnosed may speak through a dialogue with a doctor or a nurse (hereinafter referred to as “doctor etc.”).

次に、携帯端末１００は、診断対象者が発話した音声データを音声認識サーバ２００に送信する（Ｓ１０２）。 Next, the mobile terminal 100 transmits the voice data uttered by the diagnosis target person to the voice recognition server 200 (S102).

次に、音声認識サーバ２００は、音声データを受信する（Ｓ２０１）。次に、音声認識サーバ２００は、受信した音声データを音声認識してテキストデータに変換する（Ｓ２０２）。次に、音声認識サーバ２００は、変換したテキストデータを診断サーバ３００に送信する（Ｓ２０３）。 Next, the voice recognition server 200 receives voice data (S201). Next, the voice recognition server 200 performs voice recognition on the received voice data and converts it into text data (S202). Next, the voice recognition server 200 transmits the converted text data to the diagnosis server 300 (S203).

次に、診断サーバ３００は、音声認識サーバ２００から送信されたテキストデータを受信する（Ｓ３０１）。次に、診断サーバ３００は、受信したテキストデータを圧縮し（Ｓ３０２）、圧縮率を算出する（Ｓ３０３）。 Next, the diagnostic server 300 receives the text data transmitted from the voice recognition server 200 (S301). Next, the diagnostic server 300 compresses the received text data (S302) and calculates the compression rate (S303).

次に、診断サーバ３００は、圧縮率が閾値Ｘより大きいか否かを判定し、大きければ（Ｓ３０４でＹＥＳ）、診断対象者は、認知症の疑いがあると判定し（Ｓ３０５）、圧縮率が閾値Ｘよりも大きくなければ（Ｓ３０４でＮＯ）、診断対象者は認知症の疑いがないと判定する（Ｓ３０６）。次に、診断サーバ３００は、診断結果を携帯端末１００に送信する（Ｓ３０７）。 Next, the diagnosis server 300 determines whether the compression rate is larger than the threshold value X, and if the compression rate is larger (YES in S304), the diagnosis target person determines that the dementia is suspected (S305), and the compression rate. If is not larger than the threshold value X (NO in S304), the diagnosis target person determines that there is no suspicion of dementia (S306). Next, the diagnostic server 300 transmits the diagnostic result to the mobile terminal 100 (S307).

次に、携帯端末１００は、診断結果を受信し（Ｓ１０３）、受信した診断結果をタッチパネル１０２に表示する（Ｓ１０４）。これにより、診断対象者や医師等に認知症の診断結果が提示される。この場合、携帯端末１００は、例えば、上述した認知症の度合いとその認知症の度合いの説明とを含むメッセージをタッチパネル１０２に表示すればよい。認知症の度合いの説明としては、例えば、認知症度が「０」であれば、「正常です。」といったメッセージが採用でき、認知症度が「１」であれば、「認知症の疑いがあります。」といったメッセージが採用でき、認知症度が「２」であれば、「認知症の疑いが高いです。」といったメッセージが採用できる。 Next, the mobile terminal 100 receives the diagnosis result (S103) and displays the received diagnosis result on the touch panel 102 (S104). As a result, the diagnosis result of dementia is presented to the person to be diagnosed, the doctor, and the like. In this case, the mobile terminal 100 may display a message including the degree of dementia and the description of the degree of dementia on the touch panel 102, for example. As an explanation of the degree of dementia, for example, if the degree of dementia is “0”, a message such as “normal” can be adopted, and if the degree of dementia is “1”, “the suspicion of dementia is suspected”. If there is a degree of dementia of "2", a message such as "I have a high suspicion of dementia" can be adopted.

次に、本実施の形態の効果を確認するために行った実験について説明する。従来より、認知症患者の言語能力を測定する指標として種々の言語指標が使用されてきた。そこで、本実験では、これらの既存の言語指標に対して、本実施の形態の言語指標、すなわち、圧縮率を用いる言語指標が、どのくらい認知症を正確に診断できるかについて比較した。この実験では、認知症の疑いがあること（ＡＤ；Ａｌｚｈｅｉｍｅｒ’ｓｄｉｓｅａｓｅ）が事前に分かっている被験者８名と、認知症の疑いがないこと（ｎｏｎＡＤ）が事前に分かっている被験者９名とに対して、実験を行った。また、この実験では、被験者に対して質問を行い、１３分〜１７分程度の発話をしてもらった。 Next, an experiment conducted to confirm the effect of this embodiment will be described. Conventionally, various language indexes have been used as indexes for measuring the language ability of dementia patients. Therefore, in this experiment, the existing language indexes were compared with respect to how accurately the language indexes of the present embodiment, that is, the language indexes using the compression rate, can accurately diagnose dementia. In this experiment, 8 subjects were known to have a suspicion of dementia (AD; Alzheimer's disease), and 9 subjects were known to have no suspicion of dementia (nonAD). , An experiment was conducted. In addition, in this experiment, the subject was asked a question and spoken for about 13 to 17 minutes.

図４は、本実施の形態の実験で用いた言語指標を纏めた表である。図４において、「ＴＯＫＥＮ」〜「ＡＷＵ」は言語指標として従来から用いられている指標であり、「ＴＣＲ」は本実施の形態の言語指標である。 FIG. 4 is a table summarizing the language indexes used in the experiment of this embodiment. In FIG. 4, “TOKEN” to “AWU” are indexes conventionally used as language indexes, and “TCR” is a language index according to the present embodiment.

詳細には、「ＴＯＫＥＮ」は被験者が発話した単語数を示す。「ＴＹＰＥ」は被験者の発話に含まれる単語の種類数を示す。「ＴＴＲ」は、「ＴＯＫＥＮ」に対する「ＴＹＰＥ」の割合を示す。「ＴＴＲ」が低いほど同じ単語を繰り返す回数が多くなり、認知症の疑いが高くなる。 Specifically, “TOKEN” indicates the number of words spoken by the subject. “TYPE” indicates the number of types of words included in the utterance of the subject. “TTR” indicates the ratio of “TYPE” to “TOKEN”. The lower the “TTR”, the more the same word is repeated, and the higher the suspicion of dementia.

その他、詳細な説明は省くが、「ＴＰＳ」（ＴｏｋｅｎＰｅｒＳｅｃｏｎｄ）は発話速度であり、「ＬＥＬ」（ＬｅｘｉｃａｌＥｄｕｃａｔｉｏｎＬｅｖｅｌ）は語彙教育難易度であり、「ＡＤＤ」（ＡｖｅｒａｇｅＤｅｐｅｎｄｅｎｃｙＤｉｓｔａｎｃｅ）は平均化係り受け距離であり、「ＡＷＵ」（ＡｖｅｒａｇｅＷｏｒｄＵｓｅｒ）は語彙平均使用者数である。 Other details are omitted, but "TPS" (Token Per Second) is a speech rate, "LEL" (Lexical Education Level) is a difficulty level of vocabulary education, and "ADD" (Average Dependency Distance) is an average. It is a dependency distance, and “AWU” (Average Word User) is the average number of vocabulary users.

「ＴＣＲ」は、文章圧縮率（ＴｅｘｔＣｏｍｐｒｅｓｓｉｂｉｌｉｔｙＲａｔｉｏ）を示し、ここでは、使用する圧縮方式に応じて、ＺＩＰ、ＬＺＨ等の添え字を付して表している。本実験では、圧縮方式として、ＺＩＰ、ＬＺＨ、ＴＡＲＧＺ、及びＣＡＢを採用したので、それぞれに対応するＴＣＲをＴＣＲ_ＺＩＰ、ＴＣＲ_ＬＺＨ、ＴＣＲ_{ＴＡＲＧＺ}、及びＴＣＲ_ＣＡＢと表す。 “TCR” indicates a text compression ratio (Text Compressibility Ratio), and is represented here by subscripts such as ZIP and LZH according to the compression method used. In this experiment, since ZIP, LZH, TRGZ, and CAB were adopted as the compression method, the TCRs corresponding to them are represented as TCR _ZIP , TCR _LZH , TCR _TRGZ , and TCR _CAB .

図５は、本実施の形態における実験結果を纏めた表である。図５の表５１において、左から１列目は、使用した言語指標を示し、２列目のＡＤは認知症の疑いのある被験者を示し、３列目の「ｎｏｎＡＤ」は認知症の疑いのない健康な被験者を示す。 FIG. 5 is a table summarizing the experimental results in the present embodiment. In Table 51 of FIG. 5, the first column from the left shows the language index used, the second column AD represents a subject suspected of dementia, and the third column “nonAD” indicates suspected dementia. Not showing healthy subjects.

表５１に示すように、「ＡＤ」の被験者は、「ＴＩＭＥ」及び「ＴＯＫＥＮ」とも「ｎｏｎＡＤ」の被験者の「１３．２７」及び「８１４」に比べ、「１７．４０」及び「１２２５」と大きい値が得られいる。また、「ＴＴＲ」は「ｎｏｎＡＤ」の被験者の「０．３１３」に比べ、「０．２４９」と低く、「ＴＯＫＥＮ」数の割りには「ＴＹＰＥ」数が少ないことが分かる。なお、表５１において、括弧内の数値は分散を示している。 As shown in Table 51, the “AD” test subjects were “17.40” and “1225” as compared with the “13.27” and “814” test subjects of both “TIME” and “TOKEN” of “nonAD”. A large value is obtained. Moreover, "TTR" is as low as "0.249" as compared with "0.313" of the subject of "nonAD", and it can be seen that the number of "TYPE" is small relative to the number of "TOKEN". In addition, in Table 51, the numerical value in parentheses shows dispersion.

表５１において、４列目のｐ値（ｐｖａｌｕｅ）は、「ＡＤ」と「ｎｏｎＡＤ」との群間差が偶然生じる可能性を示す統計的指標である。例えば、ｐ値が０．０１（ｐ＝０．０１）ということは、該当する結果が偶然生じることが１００回に１回しか生じないことを意味する。すなわち、ｐ値がが小さくなるほど、それだけ群間差が有意であることを意味し、本実験では「ＡＤ」と「ｎｏｎＡＤ」との被験者を正確に判別できていることを意味する。一般的にｐ値が０．０５を下回ると、群間差は有意である解釈されている。 In Table 51, the p-value (p value) in the fourth column is a statistical index indicating the possibility that a difference between the groups “AD” and “nonAD” may occur by chance. For example, a p-value of 0.01 (p=0.01) means that the corresponding result will occur by chance only once in 100 times. That is, the smaller the p value is, the more significant the difference between the groups is, and in this experiment, the subjects “AD” and “nonAD” can be accurately discriminated. Generally, differences between groups are interpreted as significant at p-values below 0.05.

図５の表５２は、被験者の内訳を纏めた表であり、１列目は「ＡＤ」の被験者を示し、２列目は「ｎｏｎＡＤ」の被験者を示している。表５２において、「ＡＤ」の被験者は、男性が１名、女性が７名であり、平均年齢が「７７．２歳」であり、ＭＭＳＥスコアの平均値が「１７．０」である。ＭＭＳＥスコアとは認知症診断テストにおける被験者のスコアを示し、２１を超えると「ｎｏｎＡＤ」、２１以下であると「ＡＤ」と診断される。 The table 52 in FIG. 5 is a table summarizing the details of the subjects. The first column shows the subjects with “AD” and the second column shows the subjects with “nonAD”. In Table 52, the subjects of “AD” are 1 male and 7 female, the average age is “77.2 years”, and the average MMSE score is “17.0”. The MMSE score refers to the subject's score in the dementia diagnostic test. If the score exceeds 21, it is diagnosed as "nonAD", and if it is 21 or less, the diagnosis is "AD".

また、表５２において、「ｎｏｎＡＤ」の被験者は、男性が４名、女性が５名であり、平均年齢が「７６．６歳」であり、ＭＭＳＥスコアの平均値が「２５．１」であった。 In Table 52, the subjects of “nonAD” were 4 males and 5 females, the average age was “76.6 years old”, and the average MMSE score was “25.1”. It was

表５１に示すように、ＴＣＲ_ＺＩＰ、ＴＣＲ_ＬＺＨ、ＴＣＲ_{ＴＡＲＧＺ}、ＴＣＲ_ＣＡＢは、それぞれ、ｐ値が「０．０５４」、「０．０１５」、「０．０６０」、「０．０２９」であり、ＴＣＲ_{ＴＡＲＧＺ}以外は全てｐ値が０．０５を下回っており、「ＡＤ」と「ｎｏｎＡＤ」との被験者を正確に判別できていることが分かる。また、ＴＣＲ_{ＴＡＲＧＺ}はｐ値が０．０５を超えているというものの、その超過量はわずかであり、ほぼ正確に「ＡＤ」と「ｎｏｎＡＤ」とを判別できていると言える。 As shown in Table 51, TCR _ZIP , TCR _LZH , TCR _TRGZ , and TCR _CAB have p values of "0.054", "0.015", "0.060", and "0.029", respectively. , TCR _Targz , all p values are less than 0.05, and it can be seen that the subjects “AD” and “nonAD” can be accurately discriminated. Further, although _T-CRARGZ has a p-value of more than 0.05, the excess amount is small, and it can be said that “AD” and “nonAD” can be discriminated almost accurately.

また、従来より、認知症を診断するうえで有用な言語指標として使用されていた「ＴＴＲ」のｐ値は「０．０２」であり、有意な結果を示している。しかし、ＴＴＲは、サンプルサイズに大きく依存することが知られており、サンプルサイズが小さい場合、本実験のような高いｐ値を得ることが困難になる。 Moreover, the p-value of “TTR” which has been conventionally used as a language index useful for diagnosing dementia is “0.02”, which shows a significant result. However, it is known that TTR largely depends on the sample size, and when the sample size is small, it becomes difficult to obtain a high p-value as in this experiment.

以上、診断装置１は、圧縮率を用いて認知症が診断されているので、言語に依存することなく正確に認知症の有無を診断することができる。また、診断装置１は、発話の言語的な意味内容に依存しない手法なので、サンプル数が少ない場合であっても、ＴＴＲのようにｐ値が大きくなり、診断精度が低下することもない。 As described above, since the dementia is diagnosed by using the compression rate, the diagnosis device 1 can accurately diagnose the presence or absence of dementia without depending on the language. Further, since the diagnostic device 1 is a method that does not depend on the linguistic meaning content of the utterance, even if the number of samples is small, the p-value becomes large like the TTR and the diagnostic accuracy does not deteriorate.

（実施の形態２）
実施の形態２の診断装置は、ネットワークを介することなくローカルのコンピュータで診断装置を構成したことを特徴とする。図６は、本発明の実施の形態２に係る診断装置１Ａの全体構成を示すブロック図である。本実施の形態において、実施の形態１と同一構成のものには同一の符号を付し、説明を省く。 (Embodiment 2)
The diagnostic device of the second embodiment is characterized in that the diagnostic device is configured by a local computer without going through a network. FIG. 6 is a block diagram showing the overall configuration of diagnostic device 1A according to the second embodiment of the present invention. In the present embodiment, the same components as those in the first embodiment are designated by the same reference numerals and the description thereof will be omitted.

診断装置１Ａは、収音部４１０、処理部４２０、及び表示部４１２を備える。収音部４１０は、例えば、診断装置１Ａを構成するコンピュータに外部接続されたＩＣレコーダ、或いは、このコンピュータが内臓するマイクで構成され、人物の音声を収音する。 The diagnostic device 1A includes a sound collection unit 410, a processing unit 420, and a display unit 412. The sound pickup unit 410 is composed of, for example, an IC recorder externally connected to a computer constituting the diagnostic device 1A, or a microphone incorporated in this computer, and picks up the voice of a person.

処理部４２０及び表示部４１２は、診断装置１Ａを構成するパーソナルコンピュータ（ＰＣ）で構成されている。このＰＣはデスクトップ型ＰＣ或いはノートブック型ＰＣの何れでも良い。 The processing unit 420 and the display unit 412 are configured by a personal computer (PC) included in the diagnostic device 1A. This PC may be either a desktop PC or a notebook PC.

処理部４２０は、取得部４２１、音声認識部４２２、圧縮部４２３、圧縮率算出部４２４、及び判定部４２５を備える。これらのブロックは、ＣＰＵがコンピュータを診断装置１Ａとして機能させるための診断プログラムを実行することによって実現される。 The processing unit 420 includes an acquisition unit 421, a voice recognition unit 422, a compression unit 423, a compression rate calculation unit 424, and a determination unit 425. These blocks are realized by the CPU executing a diagnostic program for causing the computer to function as the diagnostic device 1A.

取得部４２１は、収音部４１０が収音した音声データを取得する。音声認識部４２２、圧縮部４２３、圧縮率算出部４２４、及び判定部４２５は、図１の音声認識部２０１、圧縮部３０１、圧縮率算出部３０２、及び判定部３０３と同一の機能を持つので、詳細な説明は省く。 The acquisition unit 421 acquires the sound data collected by the sound collection unit 410. The voice recognition unit 422, the compression unit 423, the compression rate calculation unit 424, and the determination unit 425 have the same functions as the voice recognition unit 201, the compression unit 301, the compression rate calculation unit 302, and the determination unit 303 in FIG. , Detailed explanation is omitted.

表示部４１２は、液晶ディスプレイや有機ＥＬディスプレイといった表示装置で構成され、図１に示すタッチパネル１０２と同一内容の画面を表示する。 The display unit 412 is composed of a display device such as a liquid crystal display or an organic EL display, and displays a screen having the same contents as the touch panel 102 shown in FIG.

図７は、本発明の実施の形態２に係る診断装置１Ａの処理の一例を示すフローチャートである。まず、収音部４１０は、診断対象者の音声を収音し、音声データを取得する（Ｓ８０１）。次に、音声認識部４２２は、取得された音声データを音声認識することでテキストデータに変換する（Ｓ８０２）。 FIG. 7 is a flowchart showing an example of processing of the diagnostic device 1A according to the second embodiment of the present invention. First, the sound collection unit 410 collects the voice of the person to be diagnosed and acquires voice data (S801). Next, the voice recognition unit 422 converts the acquired voice data into voice data by performing voice recognition (S802).

Ｓ８０３〜Ｓ８０７の処理は、それぞれ、図３のＳ３０２〜Ｓ３０６と同じである。Ｓ８０８では、判定部４２５は、診断結果を表示部４１２に表示させる。 The processes of S803 to S807 are the same as S302 to S306 of FIG. 3, respectively. In step S808, the determination unit 425 causes the display unit 412 to display the diagnosis result.

このように実施の形態２に係る診断装置１Ａによれば、ローカルのコンピュータで構成されているので、ネットワークの通信トラフィックに依存することなく速やかに診断結果を得ることができる。 As described above, according to the diagnostic device 1A according to the second embodiment, since the diagnostic device 1A is configured by the local computer, the diagnostic result can be promptly obtained without depending on the communication traffic of the network.

（実施の形態３）
実施の形態３の診断装置は、実施の形態１の診断装置１において、収音部をＩＣレコーダで構成したことを特徴とする。図８は、本発明の実施の形態３に係る診断装置１Ｂの全体構成を示す図である。本実施の形態において、実施の形態１，２と同一構成のものには同一の符号を付し、説明を省く。 (Embodiment 3)
The diagnostic device of the third embodiment is characterized in that the sound collecting unit is configured by an IC recorder in the diagnostic device 1 of the first embodiment. FIG. 8: is a figure which shows the whole structure of the diagnostic apparatus 1B which concerns on Embodiment 3 of this invention. In this embodiment, the same components as those in the first and second embodiments are designated by the same reference numerals and the description thereof will be omitted.

診断装置１Ｂにおいて、診断装置１との相違点は、携帯端末１００に代えて、ＰＣ（パーソナルコンピュータ）９２０が用いられる点、収音部１０１に代えてＩＣレコーダ９４０が用いられている点にある。 The diagnostic device 1B differs from the diagnostic device 1 in that a PC (personal computer) 920 is used in place of the mobile terminal 100, and an IC recorder 940 is used in place of the sound pickup unit 101. .

ＰＣ９２０は、デスクトップ型ＰＣ、或いはノートブック型ＰＣで構成され、表示部９２１、通信部９２２、及びＵＳＢインターフェース９２３を備える。 The PC 920 is a desktop PC or a notebook PC, and includes a display unit 921, a communication unit 922, and a USB interface 923.

表示部９２１は、液晶ディスプレイや有機ＥＬディスプレイといった表示装置で構成され、図１に示すタッチパネル１０２と同一内容の画面を表示する。 The display unit 921 includes a display device such as a liquid crystal display or an organic EL display, and displays a screen having the same contents as the touch panel 102 shown in FIG.

通信部９２２は、ＰＣ９２０をネットワークＮＴに接続するための通信装置で構成される。ＵＳＢインターフェース９２３は、ＵＳＢメモリ９３０から音声データを取得する。ＩＣレコーダ９４０は、図１に示す収音部１０１と同様、診断対象者が発話する音声を収音することで音声データを取得し、ＵＳＢメモリ９３０に記録する。 The communication unit 922 includes a communication device for connecting the PC 920 to the network NT. The USB interface 923 acquires audio data from the USB memory 930. The IC recorder 940 acquires voice data by collecting the voice uttered by the diagnosis target person and records it in the USB memory 930, as in the sound pickup unit 101 shown in FIG. 1.

図９は、本発明の実施の形態３に係る診断装置１Ｂの処理の一例を示すフローチャートである。まず、Ｓ９０１では、ＰＣ９２０は、ＵＳＢメモリ９３０からＩＣレコーダ９４０が収音した診断対象者の音声データを読み出し、音声データを取得する。以後、図９においては図３と同様の処理が行われる。すなわち、図９のＳ９０２〜Ｓ９０４では、図３のＳ１０２〜Ｓ１０４と同じ処理が行われ、図９のＳ９１１〜Ｓ９１３では、図３のＳ２０１〜Ｓ２０３と同じ処理が行われ、図９の９２１〜Ｓ９２７では、図３のＳ３０１〜Ｓ３０７と同じ処理が行われる。 FIG. 9 is a flowchart showing an example of processing of the diagnostic device 1B according to the third embodiment of the present invention. First, in S901, the PC 920 reads the voice data of the diagnosis target person picked up by the IC recorder 940 from the USB memory 930 and acquires the voice data. Thereafter, in FIG. 9, the same processing as in FIG. 3 is performed. That is, in S902 to S904 of FIG. 9, the same processing as S102 to S104 of FIG. 3 is performed, in S911 to S913 of FIG. 9, the same processing as S201 to S203 of FIG. 3 is performed, and 921 to S927 of FIG. Then, the same processing as S301 to S307 of FIG. 3 is performed.

このように実施の形態３に係る診断装置１Ｂによれば、外部接続されるＩＣレコーダ９４０が用いられているので、診断対象者の口元にＩＣレコーダ９４０を配置することが可能となり、診断対象者の発話内容を正確に収音できる。 As described above, according to the diagnostic device 1B of the third embodiment, since the IC recorder 940 that is externally connected is used, it is possible to place the IC recorder 940 in the mouth of the person to be diagnosed, and The utterance content of can be accurately collected.

（実施の形態４）
実施の形態４に係る診断装置は、音声認識を行わずにテキストデータを用いて認知症を診断することを特徴とする。図１０は、本発明の実施の形態４に係る診断装置１Ｃの全体構成を示すブロック図である。本実施の形態において、実施の形態１〜３と同一構成のものには同一の符号を付し、説明を省く。 (Embodiment 4)
The diagnostic device according to the fourth embodiment is characterized by diagnosing dementia using text data without performing voice recognition. FIG. 10 is a block diagram showing the overall configuration of diagnostic apparatus 1C according to Embodiment 4 of the present invention. In this embodiment, the same components as those in the first to third embodiments are designated by the same reference numerals and the description thereof will be omitted.

診断装置１Ｃにおいて、診断装置１Ｂとの相違点は、スキャナ９１０が新たに設けられている点、ＵＳＢメモリ９３０、ＩＣレコーダ９４０、及び音声認識サーバ２００が省かれている点にある。 The diagnostic device 1C is different from the diagnostic device 1B in that a scanner 910 is newly provided, and the USB memory 930, the IC recorder 940, and the voice recognition server 200 are omitted.

スキャナ９１０は、診断対象者の発話内容を書き起こした文書、或いは診断対象者が書いた文書を光学的に読み取り、読み取った文書に対してテキスト認識処理を行い、テキストデータを生成し、ＰＣ９２０に出力する。ここで、診断対象者の発話内容を書き起こした文書としては、例えば、診断対象者が医師等との対話或いは表示部９２１に表示されたメッセージを通じて発話した内容を医師等が紙に書き起こした文書が採用されればよい。 The scanner 910 optically reads the document in which the utterance content of the diagnosis target person has been transcribed or the document written by the diagnosis target person, performs text recognition processing on the read document, generates text data, and causes the PC 920 to perform the text data generation. Output. Here, as the document in which the content of speech of the person to be diagnosed is transcribed, for example, the content of speech that the person to be diagnosed uttered through a dialogue with a doctor or the like or a message displayed on the display unit 921 is transcribed on a paper by the doctor or the like. Documents may be adopted.

また、診断対象者が書いた文書としては、例えば、表示部９２１に表示されたメッセージ或いは医師等との対話を通じて診断対象者が直接紙に書いた文書が採用されればよい。 Further, as the document written by the diagnosis target person, for example, a document directly written on the paper by the diagnosis target person through a message displayed on the display unit 921 or a dialogue with a doctor or the like may be adopted.

ＵＳＢインターフェース９２３は、スキャナ９１０が生成したテキストデータを取得する。 The USB interface 923 acquires the text data generated by the scanner 910.

図１１は、本発明の実施の形態４に係る診断装置１Ｃの処理の一例を示すフローチャートである。まず、Ｓ１１０１では、ＰＣ９２０は、スキャナ９１０からテキストデータを取得する。次に、通信部９２２は、取得したテキストデータをネットワークＮＴを介して診断サーバ３００に送信する（Ｓ１１０２）。図１１のＳ１１１１〜Ｓ１１１７の処理は、図３のＳ３０１〜Ｓ３０７の処理と同じであり、図１１のＳ１１０３，Ｓ１１０４の処理は、図３のＳ１０３，Ｓ１０４の処理と同じである。 FIG. 11 is a flowchart showing an example of processing of the diagnostic device 1C according to the fourth embodiment of the present invention. First, in step S1101, the PC 920 acquires text data from the scanner 910. Next, the communication unit 922 transmits the acquired text data to the diagnosis server 300 via the network NT (S1102). The processing of S1111 to S1117 of FIG. 11 is the same as the processing of S301 to S307 of FIG. 3, and the processing of S1103 and S1104 of FIG. 11 is the same as the processing of S103 and S104 of FIG.

以上、実施の形態４に係る診断装置１Ｃでは、スキャナ９１０を用いてテキストデータが取得されているので、音声認識をすることなくテキストデータを取得することができ、システム全体の処理の負担を低減できる。 As described above, in the diagnostic device 1C according to the fourth embodiment, since the text data is acquired using the scanner 910, the text data can be acquired without performing voice recognition, and the processing load of the entire system is reduced. it can.

Ｘ閾値
１，１Ａ，１Ｂ，１Ｃ診断装置
１００携帯端末
１０１，４１０収音部
１０２タッチパネル
１０３制御部
１０４，２０２，３０４，９２２通信部
２００音声認識サーバ
２０１，４２２音声認識部
３００診断サーバ
３０１，４２３圧縮部
３０２，４２４圧縮率算出部
３０３判定部
４１２表示部
４２０処理部
４２１取得部
４２５判定部
９１０スキャナ
９２０ＰＣ
９２１表示部
９２３ＵＳＢインターフェース
９３０ＵＳＢメモリ
９４０ＩＣレコーダ X threshold 1,1A,1B,1C diagnostic device 100 portable terminal 101,410 sound collecting unit 102 touch panel 103 control unit 104,202,304,922 communication unit 200 speech recognition server 201,422 speech recognition unit 300 diagnostic server 301,423 Compression unit 302, 424 Compression ratio calculation unit 303 Determination unit 412 Display unit 420 Processing unit 421 Acquisition unit 425 Determination unit 910 Scanner 920 PC
921 Display unit 923 USB interface 930 USB memory 940 IC recorder

Claims

A diagnostic device for diagnosing a disease in which language appears
An acquisition unit that acquires text data indicating the utterance content of the person or the document transcribed by the person,
A compression unit that reversibly compresses the text data acquired by the acquisition unit by a data compression method in which the compression rate increases as the number of repetitions of the same word increases ,
A compression rate calculation section for calculating the compression rate of the text data losslessly compressed by the compression section;
A diagnostic device comprising: a determination unit that determines whether or not the person has a suspicion of the disease based on the compression ratio calculated by the compression ratio calculation unit.

A sound pickup unit that picks up the voice uttered by the person,
The diagnostic device according to claim 1, further comprising: a voice recognition unit configured to convert the collected voice into the text data and output the voice to the acquisition unit by voice-recognizing the collected voice.

3. The diagnostic apparatus according to claim 1, wherein the acquisition unit acquires the text data by recognizing a document in which the utterance content of the person is described or a document transcribed by the person as text.

The diagnostic device according to claim 1, wherein the determination unit determines that the person has the disease when the compression rate is higher than a predetermined threshold value.

The diagnostic device according to claim 1, wherein the determination unit determines that the degree of the disease is higher as the compression rate is higher.

The diagnostic apparatus according to claim 1, wherein the disease includes dementia, brain disease, and mental disorder.

A method for controlling a diagnostic device for diagnosing a disease in which a symptom appears in a language,
By the diagnostic device,
Get the text data indicating the speech content or document which the person has transcript of human product,
The acquired text data is losslessly compressed by a data compression method in which the compression rate increases as the number of repetitions of the same word increases ,
Calculating the compression rate of the reversibly compressed text data,
A method for controlling a diagnostic device, which compares the calculated compression rate with a predetermined threshold value .

A diagnostic program for diagnosing a disease that manifests in language,
A diagnostic program that causes a computer to function as each unit included in the diagnostic device according to claim 1.