JP2007226388A

JP2007226388A - Command input device and program

Info

Publication number: JP2007226388A
Application number: JP2006045008A
Authority: JP
Inventors: Hidesato Fukuoka; 秀悟福岡
Original assignee: Konica Minolta Medical and Graphic Inc
Current assignee: Konica Minolta Medical and Graphic Inc
Priority date: 2006-02-22
Filing date: 2006-02-22
Publication date: 2007-09-06

Abstract

<P>PROBLEM TO BE SOLVED: To accurately and quickly perform the speech input of a command name for issuing the instruction command of specific processing relating to a medical image. <P>SOLUTION: When the speech data of a speech input from a microphone are output from an A/D converter, a CPU performs the speech recognition of the speech data, and converts the speech data into character information, and stores the character information in a storage part 24 as an input character string 240. Then, when a back mode is selected as a speech input mode, a back command name whose number of characters is shorter than that of a standard command name stored in a command table 242 and the input character string 240 are compared. The CPU executes the instruction command of the standard command name associated with the matched back command name among the back command names of the command table 242. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、医用画像に関する特定処理の実行の指示命令を行うコマンド名を入力するコマンド入力装置及びプログラムに関する。 The present invention relates to a command input device and a program for inputting a command name for instructing execution of a specific process relating to a medical image.

従来から、ＣＴ（Computed Tomography）やＣＲ（Computed Radiography）、ＭＲＩ（Magnetic Resonance Imaging）、乳房撮影装置、超音波／内視鏡診断装置等といった医用画像生成装置（以下、「モダリティ」という。）によって撮影・生成された医用画像を、イメージャやビューア等の出力装置に転送する医用画像転送装置が知られている。 Conventionally, medical image generation apparatuses (hereinafter referred to as “modalities”) such as CT (Computed Tomography), CR (Computed Radiography), MRI (Magnetic Resonance Imaging), mammography apparatus, ultrasonic / endoscopic diagnosis apparatus, and the like. 2. Description of the Related Art There are known medical image transfer apparatuses that transfer medical images that have been taken and generated to an output device such as an imager or a viewer.

この医用画像転送装置は、モダリティから出力される医用画像のデータ信号（例えば、ビデオ信号やデジタルデータ）を取り込んで、例えば、ＤＩＣＯＭ（Digital Imaging and Communications in Medicine）規格に基づいて静止画像データや動画像データ（以下、これらのデータを「画像データ」と総称する。）に変換する。 This medical image transfer apparatus takes in a medical image data signal (for example, a video signal or digital data) output from a modality, for example, still image data or a moving image based on the DICOM (Digital Imaging and Communications in Medicine) standard. It is converted into image data (hereinafter, these data are collectively referred to as “image data”).

そして、その画像データに関する各種機能として、イメージャやサーバ等の外部機器に転送出力する機能、画像データを一時的に内部のメモリに記憶する機能、画像データを再生出力する機能等を実行する。撮影技師等のユーザは、これらの機能の実行を、医用画像転送装置に設けられたキースイッチやフットスイッチ等の操作することで指示する。 As various functions related to the image data, a function of transferring and outputting to an external device such as an imager or a server, a function of temporarily storing the image data in an internal memory, a function of reproducing and outputting the image data, and the like are executed. A user such as a radiographer instructs execution of these functions by operating a key switch or a foot switch provided in the medical image transfer apparatus.

例えば、ユーザが、超音波診断装置のトランスデューサを患者の腹部に当てながら超音波映像を観察し、当該超音波映像の静止画をプリント出力するとした場合には、キースイッチを押下して出力装置への転送を指示する。このとき、ユーザは、モダリティと医用画像転送装置の両方を操作しなければならない。また、医用画像転送装置の操作のために視線をモダリティから外す可能性があるため診断の妨げになる共に、様々な機器に接触することは医療衛生上にも問題である。 For example, when the user observes an ultrasound image while placing the transducer of the ultrasound diagnostic apparatus on the patient's abdomen and prints out a still image of the ultrasound image, the user presses the key switch to the output device. Instruct to transfer. At this time, the user must operate both the modality and the medical image transfer apparatus. Further, since the line of sight may be removed from the modality for the operation of the medical image transfer apparatus, the diagnosis is hindered, and contact with various devices is also a problem in medical hygiene.

このため、医用画像に関する特定処理の指示命令を行うコマンド名の音声入力により行う技術が考案され、例えば、次のような技術が知られている。即ち、マイクから音声入力したコマンド名（音声データ）が、予め登録されているコマンド名（音声データ）と完全一致しなかった場合に、音声入力されたコマンド名と類似しているものを抽出して、その抽出したコマンド名がユーザの音声入力であったか否かの確認を求める技術が知られている（例えば、特許文献１及び２参照）。
特開２０００−５１５８号公報特開２００４−２６７６３４号公報 For this reason, a technique has been devised by voice input of a command name for instructing a specific process related to a medical image. For example, the following technique is known. That is, if the command name (voice data) input by voice from the microphone does not completely match the command name (voice data) registered in advance, a command similar to the command name input by voice is extracted. In addition, there is known a technique for confirming whether or not the extracted command name is a voice input by a user (see, for example, Patent Documents 1 and 2).
JP 2000-5158 A JP 2004-267634 A

しかし、医療の現場は、人の往来が激しいと共に医療機器から発せられる音声や放送音等が多いため、外部環境のノイズの影響を受けて音声認識の認識率が低下してしまう可能性がある。このため、特許文献１及び２の技術において、音声認識に失敗した場合には、類似するとして抽出したコマンド名（音声データ）に、ユーザが発声したコマンド名が含まれず、音声入力による指示命令ができなくなる。 However, in the medical field, there is a possibility that the recognition rate of voice recognition may decrease due to the influence of noise from the external environment because there is a lot of traffic and many voices and broadcast sounds emitted from medical devices. . For this reason, in the techniques of Patent Documents 1 and 2, when speech recognition fails, the command name (speech data) extracted as similar does not include the command name uttered by the user, and an instruction command by voice input is issued. become unable.

この場合、ユーザは、意図するコマンド名が抽出されるまでコマンド名の発声を繰り返さなければならず、医用画像転送装置への指示命令に時間がかかってしまう。特に、医用画像の取り込みは、患者に負担を与えないためにも、的確且つ迅速に行うことが望ましいが、医用画像転送装置への指示命令に時間がかってしまうと、患者への負担に与えることとなってしまった。 In this case, the user must repeat the utterance of the command name until the intended command name is extracted, and the instruction command to the medical image transfer apparatus takes time. In particular, it is desirable to capture a medical image accurately and quickly so as not to burden the patient. However, if it takes a long time to instruct the medical image transfer apparatus, it may impose a burden on the patient. It has become.

本発明は、上述したような課題に鑑みて為されたものであり、その目的とすることころは、医用画像に関する特定処理の指示命令を行うコマンド名の音声入力を、確実且つ迅速に行えるようにすることである。 The present invention has been made in view of the above-described problems, and an object of the present invention is to enable reliable and quick voice input of a command name for instructing a specific process related to a medical image. Is to do.

以上の課題を解決するために、請求項１に記載のコマンド入力装置は、
音声入力手段と、
前記音声入力手段により入力された音声の音声認識を行って文字情報に変換する音声認識手段と、
医用画像に関する特定処理の実行の指示命令を行うコマンド名と、当該コマンド名より字数が短い短縮コマンド名とを対応づけて記憶する記憶手段と、
前記音声認識手段により変換された文字情報と、前記記憶手段に記憶された短縮コマンド名とを比較する比較手段と、
前記比較手段による比較の結果が一致した場合に、当該比較した短縮コマンド名に対応づけられた前記コマンド名の指示命令を実行するコマンド実行手段と、
を備えることを特徴としている。 In order to solve the above problems, a command input device according to claim 1 is provided:
Voice input means;
Voice recognition means for performing voice recognition of the voice input by the voice input means and converting it into character information;
Storage means for storing a command name for performing an instruction to execute a specific process relating to a medical image and a shortened command name having a shorter number of characters than the command name;
Comparing means for comparing the character information converted by the voice recognition means with the shortened command name stored in the storage means;
Command execution means for executing an instruction instruction of the command name associated with the compared shortened command name when the comparison result by the comparison means matches;
It is characterized by having.

請求項２に記載の発明は、請求項１に記載の発明において、
第１及び第２の音声入力モードの何れかを選択する選択手段を更に備え、
前記比較手段は、
前記選択手段により第１の音声入力モードが選択された場合、前記音声認識手段により変換された文字情報と前記コマンド名とを比較し、
前記第２の音声入力モードが選択された場合、前記音声認識手段により変換された文字情報と前記短縮コマンド名とを比較することを特徴としている。 The invention according to claim 2 is the invention according to claim 1,
And further comprising selection means for selecting one of the first and second voice input modes,
The comparison means includes
When the first voice input mode is selected by the selection means, the character information converted by the voice recognition means is compared with the command name,
When the second voice input mode is selected, the character information converted by the voice recognition means is compared with the abbreviated command name.

請求項３に記載の発明は、請求項２に記載の発明において、
前記選択手段は、
前記音声認識手段により変換された文字情報に基づいて前記第１及び第２の音声入力モードの何れかを選択することを特徴としている。 The invention according to claim 3 is the invention according to claim 2,
The selection means includes
One of the first and second voice input modes is selected based on the character information converted by the voice recognition means.

請求項４に記載のコマンド入力装置は、
音声入力手段と、
前記音声入力手段により入力された音声の音声認識を行って文字情報に変換する音声認識手段と、
医用画像に関する特定処理の実行の指示命令を行うコマンド名を複数記憶する記憶手段と、
前記記憶手段に記憶されたコマンド名の中から、前記音声認識手段により変換された文字情報に類似するコマンド名を抽出する抽出手段と、
前記コマンド名よりも字数が短い識別文字情報を、前記抽出手段により抽出されたコマンド毎に対応づけて一覧表示する一覧表示手段と、
前記音声認識手段により変換された文字情報と前記一覧表示された識別文字情報とを比較する比較手段と、
前記比較手段による比較の結果が一致した場合に、当該比較した識別文字情報に対応づけて表示された前記コマンド名の指示命令を実行するコマンド実行手段と、
を備えることを特徴としている。 The command input device according to claim 4,
Voice input means;
Voice recognition means for performing voice recognition of the voice input by the voice input means and converting it into character information;
Storage means for storing a plurality of command names for instructing execution of specific processing relating to medical images;
Extracting means for extracting a command name similar to the character information converted by the voice recognition means from the command names stored in the storage means;
A list display unit that displays a list of identification character information having a shorter number of characters than the command name in association with each command extracted by the extraction unit;
Comparison means for comparing the character information converted by the voice recognition means and the identification character information displayed in the list;
Command execution means for executing an instruction command of the command name displayed in association with the compared identification character information when the comparison result by the comparison means matches;
It is characterized by having.

請求項５に記載の発明は、請求項４に記載の発明において、
前記抽出手段は、
前記記憶手段に記憶されたコマンド名の中から、前記変換された文字情報と先頭一致するコマンド名を抽出することを特徴としている。 The invention according to claim 5 is the invention according to claim 4,
The extraction means includes
A command name matching the head of the converted character information is extracted from the command names stored in the storage means.

請求項６に記載の発明は、請求項４又は５に記載の発明において、
前記一覧表示手段は、
前記抽出手段により抽出されたコマンド名をユーザの使用頻度の高い順に一覧表示する頻度順表示手段を有することを特徴としている。 The invention according to claim 6 is the invention according to claim 4 or 5,
The list display means includes:
It is characterized by having a frequency order display means for displaying a list of command names extracted by the extraction means in descending order of user use frequency.

請求項７に記載の発明は、請求項４〜６の何れか一項に記載の発明において、
前記一覧表示手段は、
前記抽出手段により抽出されたコマンド名のうち、前記コマンド実行手段が直前に実行した指示命令のコマンド名と関連性のあるコマンド名を一覧表示する関連順一覧表示手段を有することを特徴としている。 The invention according to claim 7 is the invention according to any one of claims 4 to 6,
The list display means includes:
Among the command names extracted by the extracting means, there is provided a related order list display means for displaying a list of command names related to the command name of the instruction command executed immediately before by the command execution means.

請求項８に記載の発明は、請求項１〜７の何れか一項に記載の発明において、
医用画像に関する特定処理の実行は、外部機器への転送、記憶部への書き込み及び表示部への表示の少なくとも何れかを含むことを特徴としている。 The invention according to claim 8 is the invention according to any one of claims 1 to 7,
The execution of the specific processing relating to the medical image includes at least one of transfer to an external device, writing to a storage unit, and display on a display unit.

請求項９に記載のプログラムは、コンピュータを、
音声入力手段により入力された音声の音声認識を行って文字情報に変換する音声認識手段、
医用画像に関する特定処理の実行の指示命令を行うコマンド名と、当該コマンド名より字数が短い短縮コマンド名とを対応づけて記憶する記憶手段、
前記音声認識手段により変換された文字情報と、前記記憶手段に記憶された短縮コマンド名とを比較する比較手段、
前記比較手段による比較の結果が一致した場合に、当該比較した短縮コマンド名に対応づけられた前記コマンド名の指示命令を実行するコマンド実行手段、
として機能させることを特徴としている。 The program according to claim 9 is a computer,
Speech recognition means for performing speech recognition of speech input by the speech input means and converting it into character information;
Storage means for storing a command name for performing an instruction to execute a specific process relating to a medical image and a shortened command name having a shorter number of characters than the command name;
Comparison means for comparing the character information converted by the voice recognition means with the shortened command name stored in the storage means;
Command execution means for executing an instruction instruction of the command name associated with the compared short command name when the result of comparison by the comparison means matches;
It is characterized by making it function as.

請求項１０に記載のプログラムは、コンピュータを、
音声入力手段により入力された音声の音声認識を行って文字情報に変換する音声認識手段、
医用画像に関する特定処理の実行の指示命令を行うコマンド名を複数記憶する記憶手段、
前記記憶手段に記憶されたコマンド名の中から、前記音声認識手段により変換された文字情報に類似するコマンド名を抽出する抽出手段、
前記抽出手段により抽出されたコマンド名毎に識別文字情報を対応づけて一覧表示する一覧表示手段、
前記音声認識手段により変換された文字情報と前記一覧表示された識別文字情報とを比較する比較手段、
前記比較手段による比較の結果が一致した場合に、当該比較した識別文字情報に対応づけて表示された前記コマンド名の指示命令を実行するコマンド実行手段、
として機能させることを特徴としている。 The program according to claim 10 is a computer,
Speech recognition means for performing speech recognition of speech input by the speech input means and converting it into character information;
Storage means for storing a plurality of command names for instructing execution of specific processing relating to medical images;
Extraction means for extracting a command name similar to the character information converted by the voice recognition means from the command names stored in the storage means;
List display means for displaying a list in association with identification character information for each command name extracted by the extraction means;
Comparison means for comparing the character information converted by the voice recognition means with the identification character information displayed in the list;
Command execution means for executing an instruction command of the command name displayed in association with the compared identification character information when the result of comparison by the comparison means matches;
It is characterized by making it function as.

請求項１及び９に記載の発明によれば、音声認識して変換した文字情報と、短縮コマンド名とを比較して一致した場合に、その短縮コマンド名に対応付けられたコマンド名の指示命令を実行する。一般に、文字情報同士の比較は、その比較する字数が短い方が精度が向上する。このため、文字情報との比較は、コマンド名よりも短縮コマンド名の方が精度を高められる。従って、医用画像に関する特定処理の指示命令を行うコマンド名の音声入力を、短縮コマンド名によってより確実に行えるようにすることができる。 According to the first and ninth aspects of the present invention, when the character information converted by speech recognition and the abbreviated command name are compared and matched, a command name instruction command associated with the abbreviated command name is provided. Execute. In general, the accuracy of comparing character information is improved when the number of characters to be compared is shorter. For this reason, in comparison with character information, the accuracy of a shortened command name is higher than that of a command name. Therefore, the voice input of the command name for performing the instruction command for the specific process relating to the medical image can be more reliably performed by the shortened command name.

請求項２に記載の発明によれば、請求項１に記載の発明と同様の効果が得られるのは無論のこと、第１の音声入力モードが選択された場合は、文字情報とコマンド名とを比較し、第２の音声入力モードが選択された場合は、当該文字情報と裏コマンド名とを比較する。一般に、音声認識は、認識対象とする音声が短い程その認識率が高まる。このため、ユーザが予め第２の音声入力モードを選択して裏コマンド名を発声することで、より確実にコマンド名の指示命令を実行させることができる。 According to the second aspect of the present invention, the same effect as that of the first aspect of the invention can be obtained. When the first voice input mode is selected, the character information, the command name, If the second voice input mode is selected, the character information is compared with the reverse command name. In general, in speech recognition, the recognition rate increases as the speech to be recognized becomes shorter. For this reason, when the user selects the second voice input mode in advance and utters the back command name, the command name instruction command can be executed more reliably.

請求項３に記載の発明によれば、請求項１に記載の発明と同様の効果が得られるのは無論のこと、第１及び第２の音声入力モードの何れかを、音声認識により変換された文字情報に基づいて選択する。このため、ユーザは、音声入力モードの選択をコマンド入力装置に接触することなく行うことができる。 According to the third aspect of the invention, it is possible to obtain the same effect as the first aspect of the invention, and any one of the first and second voice input modes is converted by voice recognition. Select based on the character information. For this reason, the user can select the voice input mode without touching the command input device.

請求項４及び１０に記載の発明によれば、音声認識を行って変換した文字情報に類似するコマンド名を識別文字情報と共に一覧表示する。そして、更に音声認識を行って変換した文字情報が表示した識別文字情報と一致した場合、当該識別文字情報に対応付けられたコマンド名の指示命令を実行する。このため、音声認識に失敗したとしても、音声入力に類似するコマンド名が一覧表示されるため、ユーザは、そのコマンド名の中から所望のコマンド名を選択して、コマンド名の指示命令を実行させることができる。 According to invention of Claim 4 and 10, the command name similar to the character information converted by performing speech recognition is displayed in a list with the identification character information. When the character information converted by further voice recognition matches the displayed identification character information, an instruction command for the command name associated with the identification character information is executed. For this reason, even if voice recognition fails, a list of command names similar to voice input is displayed, so the user selects a desired command name from the command names and executes a command name instruction command Can be made.

また、識別文字情報は、コマンド名よりも字数が短いため、ユーザが識別文字情報を音声入力した場合には、音声認識の認識率が高まると共に、識別文字情報との比較の精度も向上する。従って、医用画像に関する特定処理の指示命令を行うコマンド名の音声入力を、識別文字情報の音声入力によりより確実且つ迅速に行えるようにすることができる。 Further, since the identification character information has a shorter number of characters than the command name, when the user inputs the identification character information by voice, the recognition rate of voice recognition is increased and the accuracy of comparison with the identification character information is also improved. Therefore, the voice input of the command name for performing the instruction command for the specific process regarding the medical image can be performed more reliably and quickly by the voice input of the identification character information.

請求項５に記載の発明によれば、請求項４に記載の発明と同様の効果が得られるのは無論のこと、音声認識により変換した文字情報と先頭一致するコマンド名を抽出して表示する。このため、例えば、ユーザが、所望のコマンド名の先頭部分を音声入力することで、その先頭部分で始まるコマンド名が一覧表示される。従って、ユーザは、一覧表示されたコマンド名の中から所望のコマンド名を音声入力により選択できるため、音声認識に失敗した場合のように繰り返し音声入力する手間が省け、コマンド名の音声入力を迅速に行うことができる。 According to the fifth aspect of the invention, it is possible to obtain the same effect as the fourth aspect of the invention, and extract and display the command name that matches the character information converted by voice recognition. . For this reason, for example, when the user inputs the head portion of a desired command name by voice, a list of command names starting with the head portion is displayed. Therefore, since the user can select a desired command name from the command names displayed in a list by voice input, the user can save time and effort to repeatedly input voice as in the case of voice recognition failure, and promptly input voice of a command name. Can be done.

請求項６又は７に記載の発明によれば、請求項４又は５に記載の発明と同様の効果が得られるのは無論のこと、抽出したコマンド名をユーザの使用頻度の高い順に一覧表示してもよいし、直前に実行した指示命令のコマンド名と関連性のあるコマンド名を一覧表示することとしてもよい。これにより、ユーザは、その一覧表示されたコマンド名に従って、実際の使用状況に即したコマンド名の音声入力ができる。 According to the invention described in claim 6 or 7, it is obvious that the same effect as that of the invention described in claim 4 or 5 can be obtained, and the extracted command names are displayed in a list in order of frequency of use by the user. Alternatively, a list of command names related to the command name of the instruction command executed immediately before may be displayed. As a result, the user can input a command name in accordance with the actual usage status according to the command names displayed in the list.

請求項８に記載の発明によれば、請求項１〜７の何れか一項に記載の発明と同様の効果が得られるのは無論のこと、医用画像の外部機器への転送、記憶部への書き込み及び表示部への表示の少なくとも何れかをコマンド名の音声入力に従って実行する。 According to the invention described in claim 8, it is possible to obtain the same effect as that of any one of the inventions described in claims 1-7, transfer of medical images to an external device, and storage Is written and displayed on the display unit according to the voice input of the command name.

以下、本発明のコマンド入力装置を医用画像転送装置（以下、単に「転送装置」という。）に適用し、当該転送装置を有する医用画像出力システムの実施形態について、図１〜図１０を参照して詳細に説明する。 Hereinafter, a command input device of the present invention is applied to a medical image transfer device (hereinafter simply referred to as “transfer device”), and an embodiment of a medical image output system having the transfer device is described with reference to FIGS. Will be described in detail.

〔システム構成〕
先ず、医用画像出力システムＳのシステム構成について説明する。図１は、医用画像出力システムＳのシステム構成の一例を示す図である。図１によれば、医用画像出力システムＳは、複数のモダリティＭそれぞれに接続された転送装置１と、出力装置９としてのイメージャ３、サーバ５及びカラープリンタ７とが通信ネットワークＮを介して接続されて構成されている。〔System configuration〕
First, the system configuration of the medical image output system S will be described. FIG. 1 is a diagram illustrating an example of a system configuration of the medical image output system S. As illustrated in FIG. According to FIG. 1, a medical image output system S includes a transfer device 1 connected to each of a plurality of modalities M, an imager 3 as an output device 9, a server 5, and a color printer 7 connected via a communication network N. Has been configured.

モダリティＭは、Ｘ線撮影装置や超音波診断装置、内視鏡診断装置、ＣＴ等であり、撮影・生成した医用画像のデータ信号を転送装置１に出力する。このモダリティＭには、撮影した医用画像をＤＩＣＯＭ規格に従ったデジタルデータに変換して出力するものと、当該医用画像をビデオ信号やデジタルデータのデータ信号で出力するものとがある。 The modality M is an X-ray imaging apparatus, an ultrasonic diagnostic apparatus, an endoscopic diagnostic apparatus, a CT, or the like, and outputs a data signal of a medical image captured and generated to the transfer apparatus 1. This modality M includes one that converts a captured medical image into digital data according to the DICOM standard and outputs it, and another that outputs the medical image as a video signal or a data signal of digital data.

イメージャ３は、転送装置１から転送されたＤＩＣＯＭ規格に準拠した画像データに基づいて、熱感光フィルム上に医用画像の画像形成を行って出力する。熱感光フィルムは、ＰＥＴ（ポリエチレンテレフタレート）等の支持体上に、感光性及び感熱性の感光材料を含有する乳剤が塗布されて、感光層が形成されたものである。 The imager 3 forms a medical image on a thermosensitive film based on the image data that conforms to the DICOM standard transferred from the transfer device 1 and outputs the medical image. The heat-sensitive film is a film in which a photosensitive layer is formed by coating an emulsion containing a photosensitive and heat-sensitive photosensitive material on a support such as PET (polyethylene terephthalate).

サーバ５は、表示装置や大容量の記憶装置等を備えた一般的なコンピュータにより構成され、転送装置１から転送された画像データを記憶装置に蓄積記憶（ストレージ）したり、当該画像データに基づいて医用画像を表示装置に表示出力したりするビューワとして機能する。カラープリンタ７は、レーザープリンタ等により構成され、転送装置１から転送された画像データに基づいて記録紙上に医用画像を画像形成して出力・排紙する。 The server 5 is configured by a general computer including a display device, a large-capacity storage device, and the like, and stores and stores image data transferred from the transfer device 1 in the storage device or based on the image data. It functions as a viewer that displays and outputs medical images on a display device. The color printer 7 is constituted by a laser printer or the like, and forms a medical image on a recording sheet based on the image data transferred from the transfer device 1 and outputs / discharges the medical image.

転送装置１は、モダリティＭから出力された医用画像のデータ信号を、ユーザの指示命令に従って選択された出力装置９に応じたデータ形式の画像データに変換し、当該画像データを通信ネットワークＮを介して転送する。また、転送装置１は、変換した画像データをユーザの指示命令に従って内部のメモリに一時的に記憶したり、当該画像データをモダリティＭに再生表示させたりする。これらの画像データに関する様々な機能を、医用画像に関する特定処理という。 The transfer device 1 converts the data signal of the medical image output from the modality M into image data in a data format corresponding to the output device 9 selected according to the user's instruction command, and the image data is transmitted via the communication network N. Forward. Further, the transfer device 1 temporarily stores the converted image data in an internal memory in accordance with a user instruction command, and causes the modality M to reproduce and display the image data. These various functions relating to image data are referred to as specific processing relating to medical images.

これらの医用画像の出力装置９への転送、転送装置１の内部メモリへの一時的な記憶、再生表示等の各種機能に関する指示命令は、コマンド名として転送装置１のディスプレイ１５０に表示される。 Instruction commands relating to various functions such as transfer of these medical images to the output device 9, temporary storage in the internal memory of the transfer device 1, and playback display are displayed on the display 150 of the transfer device 1 as command names.

ユーザは、ディスプレイ１５０に表示されたコマンド名の中から所望のコマンド名をキースイッチ１２０やフットスイッチ（図示略）等を押下操作することで選択して、転送装置１に当該コマンド名に対応する指示命令を行う。また、本実施形態においては、イヤーセットマイク（以下、単に「マイク」という。）１５にコマンド名を発声することで、転送装置１に当該コマンド名に対応する指示命令を行うことができる。 The user selects a desired command name from among the command names displayed on the display 150 by depressing the key switch 120, a foot switch (not shown), or the like, and corresponds the command name to the transfer apparatus 1. Directs instructions. In the present embodiment, by issuing a command name to the earset microphone (hereinafter simply referred to as “microphone”) 15, an instruction command corresponding to the command name can be issued to the transfer apparatus 1.

〔転送装置の構成〕
次に、転送装置１の機能構成について図２を参照して説明する。図２は、転送装置１の機能構成の一例を示すブロック図である。図２によれば、転送装置１は、ＣＰＵ（Central Processing Unit）１０と、音声入力部１２と、操作入力部１６と、表示部１９を制御するディスプレイＩ／Ｆ１８と、ビデオアンプ２１によって増幅されたデータ信号を復号するデコーダ２０と、通信部２２と、プログラムメモリ２４と、画像メモリ２６と、記憶部２８とがシステムバス３０に接続されて構成される。 [Configuration of transfer device]
Next, the functional configuration of the transfer apparatus 1 will be described with reference to FIG. FIG. 2 is a block diagram illustrating an example of a functional configuration of the transfer apparatus 1. According to FIG. 2, the transfer device 1 is amplified by a CPU (Central Processing Unit) 10, an audio input unit 12, an operation input unit 16, a display I / F 18 that controls the display unit 19, and a video amplifier 21. A decoder 20 that decodes the data signal, a communication unit 22, a program memory 24, an image memory 26, and a storage unit 28 are connected to a system bus 30.

ＣＰＵ１０は、各機能部の動作の制御と、機能部間のデータの入出力の制御等を行うことで転送装置１を統括的に管理・制御する制御部である。具体的には、操作入力部１６から入力される操作信号に応じてプログラムメモリ２４に格納されたプログラムを読み出し、当該プログラムに従った処理を実行する。そして、その処理結果に基づいて表示部１９の表示内容の更新や画像データの転送、記憶部２８への記憶等を行う。 The CPU 10 is a control unit that comprehensively manages and controls the transfer device 1 by controlling the operation of each functional unit and controlling input / output of data between the functional units. Specifically, a program stored in the program memory 24 is read according to an operation signal input from the operation input unit 16, and processing according to the program is executed. Then, based on the processing result, the display content of the display unit 19 is updated, image data is transferred, stored in the storage unit 28, and the like.

また、ＣＰＵ１０は、音声認識機能１１を有する。音声認識機能１１は、音声入力部１２から入力される音声データに音声認識処理を施して当該音声データを文字列に変換する機能であり、ＨＨＭ（Hidden Markov model；隠れマルコフモデル）等の公知技術を適宜採用可能である。 Further, the CPU 10 has a voice recognition function 11. The voice recognition function 11 is a function that performs voice recognition processing on voice data input from the voice input unit 12 and converts the voice data into a character string, and is a known technique such as HHM (Hidden Markov model). Can be adopted as appropriate.

音声認識機能１１の動作原理としては、公知技術であるため詳細な説明は省略するが、簡単に説明すると次のようになる。先ず、入力されたデジタルの音声データにＭＦＣＣ（Mel Frequency Cepstral Coefficients ）等による音声の特徴分析を行い、音声区間を検出する。そして、その検出結果に基づいて音声データの区間毎に、認識辞書と比較してパターン認識を行って、音声データを文字列に変換する。 Since the operation principle of the voice recognition function 11 is a known technique, a detailed description thereof will be omitted, but a brief description will be as follows. First, a voice feature analysis is performed on the input digital voice data by MFCC (Mel Frequency Cepstral Coefficients) or the like to detect a voice section. Based on the detection result, pattern recognition is performed for each section of the voice data in comparison with the recognition dictionary, and the voice data is converted into a character string.

ＣＰＵ１０は、Ａ／Ｄ変換器１３を介して入力された音声データに音声認識を施して文字列に変換し、その文字列がコマンド名と一致した場合は、そのコマンド名に対応する指示命令に従った処理を行う。このため、ユーザは、操作入力部１６の押下の代わりに、コマンド名を発声してマイク１５から音声入力することで、転送装置１に対する指示命令を行って、当該転送装置１を操作することができるようになる。 The CPU 10 performs voice recognition on the voice data input via the A / D converter 13 and converts the voice data into a character string. If the character string matches the command name, the CPU 10 generates an instruction command corresponding to the command name. Follow the process. For this reason, instead of pressing the operation input unit 16, the user can issue an instruction command to the transfer device 1 by operating the transfer device 1 by uttering a command name and inputting the voice from the microphone 15. become able to.

音声入力部１２は、マイク端子Ｔ１に着脱可能なマイク１５と、アンプ１４と、Ａ／Ｄ変換器１３と備えて構成される。アンプ１４は、マイク端子Ｔ１に接続されたマイク１５から入力された音声信号を増幅してＡ／Ｄ変換器１３に出力する。Ａ／Ｄ変換器１３は、アンプ１４によって増幅された音声信号をＡ／Ｄ変換して音声データとしてＣＰＵ１０に出力する。 The audio input unit 12 includes a microphone 15 that can be attached to and detached from the microphone terminal T1, an amplifier 14, and an A / D converter 13. The amplifier 14 amplifies the audio signal input from the microphone 15 connected to the microphone terminal T 1 and outputs the amplified audio signal to the A / D converter 13. The A / D converter 13 performs A / D conversion on the audio signal amplified by the amplifier 14 and outputs it to the CPU 10 as audio data.

操作入力部１６は、カーソルキーやテンキー等のキースイッチ１２０と、フットスイッチ等を備えて構成され、押下されたキースイッチの操作信号をＣＰＵ１０に出力する。 The operation input unit 16 includes a key switch 120 such as a cursor key or a numeric keypad, a foot switch, and the like, and outputs an operation signal of the pressed key switch to the CPU 10.

ディスプレイＩ／Ｆ１８は、ＲＧＢインターフェイスやＮＴＳＣインターフェイス等により構成され、ＣＰＵ１０の制御に基づいて表示部１９の表示素子のＯＮ／ＯＦＦを制御する。表示部１９は、図１に示すディスプレイ１５０に相当し、ＣＲＴ（Cathode-ray Tube）やＬＣＤ（Liquid Crystal Display）等により構成される。表示部１９は、ディスプレイＩ／Ｆ１８を介して入力されたＣＰＵ１０の制御に基づいた表示画面の表示や、画像データの再生表示を行う。 The display I / F 18 is configured by an RGB interface, an NTSC interface, or the like, and controls ON / OFF of the display element of the display unit 19 based on the control of the CPU 10. The display unit 19 corresponds to the display 150 shown in FIG. 1 and is configured by a CRT (Cathode-ray Tube), an LCD (Liquid Crystal Display), or the like. The display unit 19 displays a display screen based on the control of the CPU 10 input via the display I / F 18 and reproduces and displays image data.

尚、画像データの再生表示は、モダリティＭが有する表示部に対して行うこととしてもよく、この場合は、画像データを例えば、ＮＴＳＣ形式のビデオ信号に変換するエンコーダと、モダリティＭにビデオ信号を出力するビデオ出力端子とを設けることで実現可能である。 The image data may be reproduced and displayed on the display unit of the modality M. In this case, an encoder that converts the image data into, for example, an NTSC format video signal, and the video signal to the modality M are displayed. This can be realized by providing a video output terminal for output.

ビデオアンプ２１は、ＮＴＳＣ（National Television Standards Committee）／ＰＡＬ（Phase Alternation by Line）コンポジットビデオ信号やＹ／Ｃコンポーネントビデオ信号、ＲＧＢセパレートビデオ信号等の入力が可能なビデオ端子Ｔ３に接続されたモダリティＭから出力されるビデオ信号を増幅してデコーダ２０に出力する。デコーダ２０は、ビデオアンプ２１によって増幅されたビデオ信号をＣＰＵ１０の制御に従って、所定のデータ形式に復号して画像データを生成する。尚、復号するデータ形式としては、ＤＩＣＯＭ形式やＪＰＥＧ形式、ＭＰＥＧ形式等があり、転送する出力装置９に応じて選択される。また、ビデオアンプ２１やデコーダ２０を介することなくＲＳ４２２形式でデジタルデータを直接入力可能なモダリティＭの場合、ビデオアンプ２１やデコーダ２０の代わりに、ＲＳ４２２レシーバを設けてもよい。 The video amplifier 21 is a modality M connected to a video terminal T3 capable of inputting a NTSC (National Television Standards Committee) / PAL (Phase Alternation by Line) composite video signal, a Y / C component video signal, an RGB separate video signal, or the like. The video signal output from is amplified and output to the decoder 20. The decoder 20 decodes the video signal amplified by the video amplifier 21 into a predetermined data format under the control of the CPU 10 to generate image data. The data format to be decoded includes DICOM format, JPEG format, MPEG format, and the like, and is selected according to the output device 9 to be transferred. In the case of the modality M that can directly input digital data in the RS422 format without going through the video amplifier 21 or the decoder 20, an RS422 receiver may be provided instead of the video amplifier 21 or the decoder 20.

通信部２２は、ＬＡＮインターフェイス等により構成され、ネットワーク端子Ｔ５を介して通信ネットワークＮに接続されて、当該通信ネットワークＮを介して出力装置９のイメージャ３やサーバ５、カラープリンタ７とデータ通信する機能部である。 The communication unit 22 is configured by a LAN interface or the like, connected to the communication network N via the network terminal T5, and performs data communication with the imager 3, the server 5, and the color printer 7 of the output device 9 via the communication network N. It is a functional part.

プログラムメモリ２４は、ＲＯＭ（Read Only Memory）やフラッシュＲＯＭ等によって構成され、ＣＰＵ１０が実行する初期プログラムやアプリケーションプログラム等の各種プログラムを記憶するメモリ領域である。画像メモリ２６は、ＶＲＡＭ（Video RAM）等の揮発性メモリにより構成されて、出力装置９に転送する画像データを一時的に記憶するメモリ領域である。 The program memory 24 is configured by a ROM (Read Only Memory), a flash ROM, or the like, and is a memory area that stores various programs such as an initial program and application programs executed by the CPU 10. The image memory 26 is configured by a volatile memory such as a VRAM (Video RAM), and is a memory area that temporarily stores image data to be transferred to the output device 9.

記憶部２８は、ＨＤＤ（Hard Disk Drive）や半導体メモリ等を備えて構成され、ＣＰＵ１０が実行するプログラムに係るデータや、デコーダ２０によって変換された画像データ等を記憶する不揮発性の記憶領域である。 The storage unit 28 includes an HDD (Hard Disk Drive), a semiconductor memory, and the like, and is a non-volatile storage area that stores data related to a program executed by the CPU 10, image data converted by the decoder 20, and the like. .

〔第１実施形態〕
次に、転送装置１の第１実施形態について図３〜図５を参照して説明する。第１実施形態における転送装置１は、コマンド名の音声入力モードとして、標準モードと裏モードとが設けられる。 [First Embodiment]
Next, a first embodiment of the transfer device 1 will be described with reference to FIGS. The transfer apparatus 1 according to the first embodiment is provided with a standard mode and a back mode as command name voice input modes.

標準モードは、一つ又は複数の単語で表される標準コマンド名で転送装置１に対する指示命令を音声入力するモードである。標準コマンド名としては、例えば、画像データを記憶部２８に記憶させる指示命令としての「キロク」、表示部１９に再生表示させる指示命令としての「サイセイ」、出力装置９に転送させる指示命令としての「テンソウ」等がある。 The standard mode is a mode in which an instruction command for the transfer apparatus 1 is input by voice with a standard command name represented by one or a plurality of words. Standard command names include, for example, “Kiroku” as an instruction command for storing image data in the storage unit 28, “Saisei” as an instruction command for reproduction and display on the display unit 19, and an instruction command for transfer to the output device 9. There is "Tenso".

裏モードは、標準コマンド名よりも短い字数の文字列又は数字の裏コマンド名で転送装置１に対する指示命令を音声入力するモードである。裏コマンド名としては、例えば、標準コマンド名の「キロク」に対応する「ケー」、「サイセイ」に対応する「エス」、「テンソウ」に対応する「ティー」等がある。 The reverse mode is a mode in which an instruction command to the transfer apparatus 1 is input by voice using a character string having a shorter number of characters or a numeric reverse command name than the standard command name. Examples of the back command name include “K” corresponding to the standard command name “Kiroku”, “S” corresponding to “Saisei”, “Tee” corresponding to “Tenso”, and the like.

ユーザは、操作入力部１６のキースイッチ１２０又はフットスイッチを押下することによって、標準モード及び裏モードの何れかを選択して、その選択した音声入力モードに応じたコマンド名を発声することで、転送装置１に対する指示命令を行う。 The user selects either the standard mode or the back mode by pressing the key switch 120 or the foot switch of the operation input unit 16, and utters a command name corresponding to the selected voice input mode. An instruction command is given to the transfer device 1.

図３（ａ）は、第１実施形態における記憶部２８のデータ構成の一例を示す図である。図３（ａ）によれば、記憶部２８は、入力文字列２４０と、コマンドテーブル２４２とを記憶している。入力文字列２４０は、ＣＰＵ１０が音声データに音声認識処理を施して取得した文字列である。 FIG. 3A is a diagram illustrating an example of a data configuration of the storage unit 28 in the first embodiment. According to FIG. 3A, the storage unit 28 stores an input character string 240 and a command table 242. The input character string 240 is a character string acquired by the CPU 10 performing voice recognition processing on the voice data.

コマンドテーブル２４２は、図３（ｂ）に示すように、標準コマンド名と裏コマンド名とを番号ｎ順に対応づけて記憶するデータテーブルである。例えば、コマンドテーブルの先頭（ｎ＝１）には、標準コマンド名「キロク」と裏コマンド名「ケー」とが対応づけられて記憶されている。 As shown in FIG. 3B, the command table 242 is a data table that stores standard command names and back command names in association with each other in the order of number n. For example, at the head (n = 1) of the command table, the standard command name “KIROK” and the reverse command name “K” are stored in association with each other.

ＣＰＵ１０は、音声入力モードとして標準モードが選択されている場合には、入力文字列２４０と標準コマンド名とを比較し、裏モードが選択されている場合には、裏コマンド名と比較する。そして、その比較の結果、一致した標準コマンド名又は裏コマンド名の指示命令が音声入力された判定して対応する処理を実行する。 The CPU 10 compares the input character string 240 with the standard command name when the standard mode is selected as the voice input mode, and compares with the back command name when the back mode is selected. Then, as a result of the comparison, it is determined that the instruction command for the matched standard command name or back command name is inputted by voice, and the corresponding processing is executed.

一般に、音声認識の対象とする音声データが短いほうがその認識率が高くなる。このため、予め音声入力モードとして裏モードを選択して、ユーザが裏コマンド名を音声入力することで、その音声認識の認識率を向上させることができる。また、文字情報同士の比較は、比較の対象となる文字列の長さが短い程その精度が高くなる。このため、標準コマンド名よりも短いの裏コマンド名で比較を行うことにより、入力文字列２４０との比較の精度が高くなる。 In general, the recognition rate increases as the speech data to be speech-recognized is shorter. For this reason, when the reverse mode is selected as the voice input mode in advance and the user inputs the reverse command name by voice, the recognition rate of the voice recognition can be improved. Further, the accuracy of comparing character information increases as the length of the character string to be compared becomes shorter. For this reason, the comparison with the input character string 240 is enhanced by performing the comparison with the back command name shorter than the standard command name.

次に、図４のフローチャートと、図５の表示画面例とを参照して転送装置１の具体的な動作について説明する。先ず、ＣＰＵ１０は、操作入力部１６から出力された操作信号に基づいて、ユーザが選択した音声入力モードを判定する（ステップＡ１）。そして、標準モードが選択されたと判定した場合は（ステップＡ１；標準モード）、ステップＡ３〜Ａ２１の処理を行い、裏モードが選択されたと判定した場合は（ステップＡ１；裏モード）、ステップＡ２３〜Ａ４１の処理を行う。 Next, a specific operation of the transfer apparatus 1 will be described with reference to the flowchart of FIG. 4 and the display screen example of FIG. First, the CPU 10 determines the voice input mode selected by the user based on the operation signal output from the operation input unit 16 (step A1). If it is determined that the standard mode has been selected (step A1; standard mode), the processes of steps A3 to A21 are performed. If it is determined that the reverse mode has been selected (step A1; back mode), steps A23 to Process A41 is performed.

ＣＰＵ１０は、標準モードが選択されたと判定した場合は、図５（ａ）のようなコマンド入力画面１９０を表示部１９に表示させて、マイク１５からの音声入力によってＡ／Ｄ変換器１３からの音声データの入力を待機する。そして、Ａ／Ｄ変換器１３から入力された音声データに音声認識処理を施して（ステップＡ３）、音声データを文字列に変換して入力文字列２４０として記憶部２８に記憶する（ステップＡ５）。 When determining that the standard mode has been selected, the CPU 10 displays a command input screen 190 as shown in FIG. 5A on the display unit 19, and receives a voice input from the microphone 15 from the A / D converter 13. Wait for input of audio data. Then, voice recognition processing is performed on the voice data input from the A / D converter 13 (step A3), and the voice data is converted into a character string and stored in the storage unit 28 as the input character string 240 (step A5). .

次いで、ＣＰＵ１０は、コマンドテーブル２４２の先頭（ｎ＝１）の標準コマンド名をコマンド候補として選択し（ステップＡ７）、そのコマンド候補と入力文字列２４０とを比較する（ステップＡ１１）。ＣＰＵ１０は、この比較の結果、一致したと判定した場合は（ステップＡ１１；Ｙｅｓ）、コマンド候補の指示命令（コマンド）に対応する処理を実行する（ステップＡ１７）。 Next, the CPU 10 selects the standard command name at the head (n = 1) of the command table 242 as a command candidate (step A7), and compares the command candidate with the input character string 240 (step A11). As a result of this comparison, if the CPU 10 determines that they match (step A11; Yes), the CPU 10 executes processing corresponding to the command candidate instruction command (command) (step A17).

また、コマンド候補と入力文字列２４０とが一致しないと判定した場合（ステップＡ１１；Ｎｏ）、コマンド候補として選択している標準コマンド名がコマンドテーブル２４２の最後尾でなければ（ステップＡ１３；Ｎｏ）、次（ｎ＝ｎ＋１）の標準コマンド名をコマンド候補として順次選択する（ステップＡ１５）。そして、ステップＡ１９の処理に移行して、入力文字列２４０とコマンド候補とを比較する。 If it is determined that the command candidate and the input character string 240 do not match (step A11; No), the standard command name selected as the command candidate is not the tail of the command table 242 (step A13; No). The next (n = n + 1) standard command names are sequentially selected as command candidates (step A15). Then, the process proceeds to step A19, and the input character string 240 is compared with the command candidate.

例えば、図５（ａ）のコマンド入力画面１９０において「キロク」という音声データが音声入力された場合には、入力文字列２４０としての「キロク」とコマンドテーブル２４２の標準コマンド名とを比較していく。図３（ｂ）のコマンドテーブルにおいては、先頭の標準コマンド名と入力文字列２４０とが一致すると判定して、モダリティＭから取得した画像データの記憶部２８への記憶（記録）を開始し、図５（ｂ）のメッセージ画面１９２を表示する。 For example, when voice data “Kiroku” is inputted by voice on the command input screen 190 in FIG. 5A, “Kirok” as the input character string 240 is compared with the standard command name in the command table 242. Go. In the command table of FIG. 3B, it is determined that the leading standard command name matches the input character string 240, and storage (recording) of the image data acquired from the modality M into the storage unit 28 is started. The message screen 192 shown in FIG. 5B is displayed.

ＣＰＵ１０は、ステップＡ１３において、コマンド候補として選択している標準コマンド名がコマンドテーブル２４２の最後尾であると判定した場合は（ステップＡ１３；Ｙｅｓ）、再度音声入力を依頼するメッセージ（例えば、「もう一度コマンドを入力して下さい」）を表示部１９に表示して（ステップＡ１９）、ステップＡ２１に移行する。 If the CPU 10 determines in step A13 that the standard command name selected as the command candidate is the tail of the command table 242 (step A13; Yes), the CPU 10 requests a voice input again (for example, “one more time” Enter command ") is displayed on the display unit 19 (step A19), and the process proceeds to step A21.

ＣＰＵ１０は、ステップＡ１７及びＡ１９の処理後、ユーザにより例えば、キースイッチ１２０の終了キーが押下されたと判定した場合は（ステップＡ２１；Ｙｅｓ）、標準モードによる音声入力を終了し、当該終了キーが押下されずに継続すると判定した場合は（ステップＡ２１；Ｎｏ）、ステップＡ３に移行して、ステップＡ３〜Ａ１９の処理を繰り返す。 For example, when the CPU 10 determines that the end key of the key switch 120 has been pressed after the processing of steps A17 and A19 (step A21; Yes), the CPU 10 ends the voice input in the standard mode and presses the end key. If it is determined that the process is to be continued (step A21; No), the process proceeds to step A3, and the processes of steps A3 to A19 are repeated.

一方、ＣＰＵ１０は、ステップＡ１において、裏モードが選択されたと判定した場合は、図５（ｃ）のコマンド入力画面１９４を表示すると共に、Ａ／Ｄ変換器１３から出力された音声データに音声認識処理を施し（ステップＡ２３）、その認識結果から入力文字列２４０を記憶部２８に記憶する（ステップＡ２５）。ＣＰＵ１０は、コマンドテーブル２４２の先頭（ｎ＝１）の裏コマンド名をコマンド候補として選択し（ステップＡ２７）、そのコマンド候補と入力文字列２４０とを比較する（ステップＡ３１）。 On the other hand, if the CPU 10 determines in step A1 that the reverse mode has been selected, the CPU 10 displays the command input screen 194 in FIG. 5C and recognizes the voice data output from the A / D converter 13 as voice recognition. Processing is performed (step A23), and the input character string 240 is stored in the storage unit 28 from the recognition result (step A25). The CPU 10 selects the back command name at the head (n = 1) of the command table 242 as a command candidate (step A27), and compares the command candidate with the input character string 240 (step A31).

そして、ステップＡ３１における比較の結果、一致したと判定した場合は（ステップＡ３１；Ｙｅｓ）、コマンド候補として選択した裏コマンド名の指示命令に対応する処理を実行する（ステップＡ３７）。ＣＰＵ１０は、コマンド実行後、ステップＡ２１と同様に終了キーが押下されたと判定した場合は（ステップＡ４１；Ｙｅｓ）、裏モードによる音声入力を終了し、継続すると判定した場合は（ステップＡ４１；Ｎｏ）、ステップＡ２３に移行して、ステップＡ２３〜Ａ３９の処理を繰り返す。 If it is determined as a result of comparison in step A31 (step A31; Yes), processing corresponding to the instruction command for the reverse command name selected as the command candidate is executed (step A37). When the CPU 10 determines that the end key has been pressed after the command execution (step A41; Yes), the CPU 10 ends the voice input in the back mode and continues (step A41; No). Then, the process proceeds to step A23, and the processes of steps A23 to A39 are repeated.

尚、ステップＡ３１において選択した裏コマンド名（コマンド候補）と入力文字列２４０とが一致しなかった場合に行う処理（ステップＡ３３，Ａ３５，Ａ３９）は、標準モードにおけるステップＡ１３，Ａ１５，Ａ１９と同様であるためその説明は省略する。 The processing (steps A33, A35, A39) performed when the back command name (command candidate) selected in step A31 does not match the input character string 240 is the same as steps A13, A15, A19 in the standard mode. Therefore, the description thereof is omitted.

例えば、音声認識を行って「ケー」という入力文字列２４０を取得した場合、この入力文字列２４０「ケー」とコマンドテーブル２４２の裏コマンド名とを比較していく。図３（ｂ）のコマンドテーブルにおいては、先頭の裏コマンド名と入力文字列２４０とが一致すると判定する。そして、その裏コマンド名に対応づけられた標準コマンド名の「キロク」の指示命令、即ち、モダリティＭから取得した画像データの記憶部２８への記憶（記録）を開始し、図５（ｂ）のメッセージ画面１９２を表示する。 For example, when the input character string 240 “K” is acquired by performing voice recognition, the input character string 240 “K” is compared with the reverse command name of the command table 242. In the command table of FIG. 3B, it is determined that the leading back command name and the input character string 240 match. Then, the instruction command of “Kiroku” of the standard command name associated with the back command name, that is, the storage (recording) of the image data acquired from the modality M to the storage unit 28 is started, and FIG. Message screen 192 is displayed.

このように、裏モードにおいては標準モードのコマンド名「キロク」よりも字数が小さい「ケー」という音声入力によって、画像データの記憶部２８への記憶を転送装置１に指示することができる。 Thus, in the reverse mode, the transfer device 1 can be instructed to store the image data in the storage unit 28 by a voice input of “K”, which has a smaller number of characters than the command name “KIROK” in the standard mode.

以上、第１実施形態によれば、標準コマンド名よりも短い字数の裏コマンド名を当該標準コマンド名に対応づけて記憶して、裏モードにおいて音声入力された音声が裏コマンド名と一致した場合は、その裏コマンド名に対応する標準コマンド名の指示命令を実行する。これにより、標準コマンド名よりも短い音声入力で転送装置１を操作できるようになる。 As described above, according to the first embodiment, the back command name having a shorter number of characters than the standard command name is stored in association with the standard command name, and the voice input in the back mode matches the back command name. Executes the instruction command of the standard command name corresponding to the reverse command name. As a result, the transfer apparatus 1 can be operated with a voice input shorter than the standard command name.

このため、医療の現場の環境ノイズやユーザの声色の変化等の影響等を受けにくくすることができ、転送装置１に対する指示命令の音声入力をより確実に行うことができる。また、音声入力により転送装置１を操作できるため、医用画像の取り込みにおける操作が容易になると共に、ユーザの転送装置１に対する接触が減るため衛生面も改善される。 For this reason, it can be made hard to receive the influence of the environmental noise of a medical field, a user's voice color change, etc., and the voice input of the instruction command with respect to the transfer apparatus 1 can be performed more reliably. In addition, since the transfer device 1 can be operated by voice input, an operation for capturing a medical image is facilitated, and a user's contact with the transfer device 1 is reduced, so that hygiene is improved.

尚、上述した第１実施形態において、標準モードと裏モードとの選択を操作入力部１６の操作によって行うこととしたが、この音声入力モードの選択を音声入力によって行うこととしてもよい。より具体的には、標準モードを選択するためのキーワード（例えば、「ヒョウジュン」）と、裏モードを選択するためのキーワード（例えば、「ウラ」）とを予め設定しておき、これらのキーワードが音声入力されたことによって標準モードと裏モードとを切り替えることとしてもよい。これにより、ユーザは、転送装置１の操作入力部１６に接触することなく音声入力モードを選択することができる。 In the first embodiment described above, the selection between the standard mode and the back mode is performed by operating the operation input unit 16, but the selection of the voice input mode may be performed by voice input. More specifically, a keyword for selecting the standard mode (for example, “Hyojun”) and a keyword for selecting the back mode (for example, “Ura”) are set in advance, and these keywords are set. It is also possible to switch between the standard mode and the back mode when voice is input. Thereby, the user can select the voice input mode without touching the operation input unit 16 of the transfer apparatus 1.

また、コマンド名を音声認識した後に、そのコマンド名の指示命令を実行することとして説明したが、そのコマンド名を一旦ユーザに確認してから実行することとしてもよい。例えば、標準モードが選択され、ステップＡ１１においてコマンド候補と入力文字列２４０とが一致すると判定した場合に、図５（ｄ）のコマンド確認画面１９６を表示部１９に表示する。そして、「ハイ」という音声データが音声入力された場合には、コマンド名の指示命令を実行し、「イイエ」という音声データが音声入力された場合は、ステップＡ１３に移行して、コマンドテーブル２４２からコマンド候補を選択して比較していく。 Further, although it has been described that the command name is instructed to be executed after voice recognition of the command name, the command name may be once confirmed with the user and executed. For example, when the standard mode is selected and it is determined in step A11 that the command candidate matches the input character string 240, the command confirmation screen 196 of FIG. When voice data “high” is inputted by voice, a command name instruction command is executed. When voice data “yes” is inputted by voice, the process proceeds to step A13 and the command table 242 is executed. Select command candidates from and compare them.

また、裏モードが選択された場合には、図５（ｅ）のコマンド確認画面１９８を表示部１９に表示する。そして、「ワイ（イエス）」という音声データが音声入力された場合に、コマンド名の指示命令を実行し、標準モードよりも字数の「エヌ（ノー）」という音声データが音声入力された場合に、ステップＡ３３に移行する。 When the back mode is selected, the command confirmation screen 198 shown in FIG. Then, when voice data “wai (yes)” is inputted by voice, a command name instruction command is executed, and when voice data “n (no)” having the number of characters is inputted by voice compared to the standard mode. The process proceeds to step A33.

このように、コマンド確認画面１９６及び１９８において音声認識したコマンド名を確認することで、誤った指示命令の実行を防止することができると共に、裏モードにおいては、その確認時の音声入力の認識率を高めることができる。 In this way, by confirming the command name that has been voice-recognized on the command confirmation screens 196 and 198, execution of an erroneous instruction command can be prevented, and in the back mode, the recognition rate of voice input at the time of the confirmation Can be increased.

また、裏コマンド名を、標準コマンド名をアルファベット表記した場合の頭文字に設定することとしたが、例えば、図５（ｆ）のようなコマンド入力画面２００を表示して、「イチ（１）」、「ニ（２）」、「サン（３）」、・・・、といった数字を裏コマンド名として設定することとしてもよい。この場合も、標準コマンド名よりも短い字数に裏コマンド名を設定することができるため、第１実施形態と同様の効果が得られるのは無論である。 In addition, the reverse command name is set to the initial letter when the standard command name is expressed in alphabets. For example, a command input screen 200 as shown in FIG. ”,“ D (2) ”,“ Sun (3) ”,... May be set as the back command name. Also in this case, since the reverse command name can be set to a shorter number of characters than the standard command name, it is needless to say that the same effect as in the first embodiment can be obtained.

〔第２実施形態〕
次に、転送装置１の第２実施形態について図６〜図７を参照して説明する。尚、第１実施形態における転送装置１と同一の構成要素には、同一の符号を伏してその詳細な説明は適宜省略する。 [Second Embodiment]
Next, a second embodiment of the transfer device 1 will be described with reference to FIGS. Note that the same components as those of the transfer device 1 in the first embodiment are denoted by the same reference numerals, and detailed description thereof is omitted as appropriate.

第２実施形態における転送装置１は、音声入力モードとして標準モードと選択モードとを有して構成される。選択モードは、標準モードにおいて入力文字列２４０と一致する標準コマンド名とがなかった場合に、当該入力文字列２４０と類似するコマンド名（以下、「類似コマンド名」という。）を一覧表示して、その類似コマンド名の中からの選択を可能としたモードである。 The transfer apparatus 1 in the second embodiment is configured to have a standard mode and a selection mode as voice input modes. In the selection mode, when there is no standard command name that matches the input character string 240 in the standard mode, a list of command names similar to the input character string 240 (hereinafter referred to as “similar command names”) is displayed. This mode enables selection from the similar command names.

図６（ａ）に、第２実施形態における記憶部２８のデータ構成の一例を示す。図６（ａ）によれば、記憶部２８は、入力文字列２４０と、コマンドテーブル２４４と、類似コマンドテーブル２４６とを記憶する。第２実施形態におけるコマンドテーブル２４４は、図６（ｂ）に示すように番号（ｎ）順に標準コマンド名を記憶するデータテーブルである。 FIG. 6A shows an example of the data configuration of the storage unit 28 in the second embodiment. According to FIG. 6A, the storage unit 28 stores an input character string 240, a command table 244, and a similar command table 246. The command table 244 in the second embodiment is a data table that stores standard command names in the order of number (n) as shown in FIG.

類似コマンドテーブル２４６は、入力文字列２４０に類似するコマンド名をコマンドテーブル２４４から抽出して蓄積的に記憶するデータテーブルであり、図６（ｃ）に示すように番号（ｍ）順に類似コマンド名を記憶する。 The similar command table 246 is a data table in which command names similar to the input character string 240 are extracted from the command table 244 and stored accumulatively, and similar command names are in order of number (m) as shown in FIG. Remember.

ＣＰＵ１０は、標準モードにおいて入力文字列２４０と一致する標準コマンド名がコマンドテーブル２４４に記憶されていなかった場合は、音声入力モードを選択モードに切り替える。そして、入力文字列２４０の先頭文字を取得し、この先頭文字から始まる標準コマンド名を類似コマンド名としてコマンドテーブル２４４から抽出して、類似コマンドテーブル２４６に蓄積記憶していく。ユーザは、類似コマンドテーブル２４６の番号（ｍ）を音声入力することで、所望の類似コマンド名を選択して、当該コマンド名の指示命令を転送装置１に実行させることができる。 When the standard command name that matches the input character string 240 is not stored in the command table 244 in the standard mode, the CPU 10 switches the voice input mode to the selection mode. Then, the first character of the input character string 240 is acquired, the standard command name starting from the first character is extracted from the command table 244 as a similar command name, and stored in the similar command table 246. The user can select a desired similar command name by voice input of the number (m) in the similar command table 246 and cause the transfer apparatus 1 to execute an instruction command of the command name.

次に、第２実施形態における転送装置１の具体的な動作について、図７のフローチャートを参照して説明する。尚、第１実施形態の図４のフローチャートと同一の処理内容には、同一のステップ番号を付してその説明を省略する。 Next, a specific operation of the transfer apparatus 1 in the second embodiment will be described with reference to the flowchart of FIG. In addition, the same processing number as the flowchart of FIG. 4 of 1st Embodiment attaches | subjects the same step number, and the description is abbreviate | omitted.

先ず、ＣＰＵ１０は、音声入力モードを標準モードとして、第１実施形態と同一のステップＡ３〜Ａ２１の処理を行うが、ステップＡ１３において、コマンド候補がコマンドテーブル２４４の最後尾であると判定した場合、即ち、入力文字列２４０と一致する標準コマンド名がコマンドテーブル２４４に記憶されていなかった場合（ステップＡ１３；Ｙｅｓ）、音声入力モードを標準モードから選択モードに切り替え、次の処理を行う。 First, the CPU 10 sets the voice input mode as the standard mode, and performs the same processes of steps A3 to A21 as in the first embodiment. However, if it is determined in step A13 that the command candidate is the tail of the command table 244, That is, if the standard command name that matches the input character string 240 is not stored in the command table 244 (step A13; Yes), the voice input mode is switched from the standard mode to the selection mode, and the following processing is performed.

具体的には、コマンドテーブル２４４内の先頭（ｎ＝１）の標準コマンド名をコマンド候補として選択し（ステップＢ２３）、入力文字列２４０の先頭文字と、コマンド候補の先頭文字とが一致するか否かを比較する（ステップＢ２７）。そして、比較の結果、コマンド候補と入力文字列２４０の先頭文字が一致した場合は（ステップＳ２７；Ｙｅｓ）、その標準コマンド名を類似コマンド名としてコマンドテーブル２４４に追加記憶する（ステップＢ２９）。 Specifically, the first (n = 1) standard command name in the command table 244 is selected as a command candidate (step B23), and the first character of the input character string 240 matches the first character of the command candidate. Whether or not is compared (step B27). If the command candidate matches the first character of the input character string 240 as a result of the comparison (step S27; Yes), the standard command name is additionally stored in the command table 244 as a similar command name (step B29).

また、ステップＢ２５の比較の結果、コマンド候補と入力文字列２４０の先頭文字が一致しなかった場合、ＣＰＵ１０は、コマンド候補として選択している標準コマンド名がコマンドテーブル２４４の最後尾であるか否かを判定する（ステップＢ３１）。ＣＰＵ１０は、最後尾ではないと判定した場合は（ステップＢ３１；Ｎｏ）、次（ｎ＝ｎ＋１）の標準コマンド名をコマンドテーブル２４４の中から選択して（ステップＢ３３）、ステップＢ２５の処理に移行する。 If the command candidate does not match the first character of the input character string 240 as a result of the comparison in step B25, the CPU 10 determines whether the standard command name selected as the command candidate is the tail of the command table 244. Is determined (step B31). If the CPU 10 determines that it is not the end (step B31; No), it selects the next (n = n + 1) standard command name from the command table 244 (step B33), and proceeds to the processing of step B25. To do.

例えば、音声認識の結果、「セイガ」という入力文字列２４０を取得したとする。このとき、図６（ｂ）のコマンドテーブル２４４に「セイガ」という標準コマンド名は記憶されていないため、「セ」で始まる標準コマンド名の「セイシガ」と「セッテイ」をコマンドテーブル２４４から抽出して図６（ｃ）のデータ構成の類似コマンドテーブル２４６に記憶する。このようにして、入力文字列２４０の先頭文字で始まる標準コマンド名、即ち類似する標準コマンド名を記憶した類似コマンドテーブル２４６が作成される。 For example, it is assumed that an input character string 240 “SEIGA” is acquired as a result of voice recognition. At this time, since the standard command name “SEIGA” is not stored in the command table 244 of FIG. 6B, the standard command names “SEISHIGA” and “SETTING” starting with “SE” are extracted from the command table 244. And stored in the similar command table 246 having the data structure shown in FIG. In this way, a similar command table 246 storing a standard command name starting with the first character of the input character string 240, that is, a similar standard command name is created.

ＣＰＵ１０は、ステップＢ３１において選択している標準コマンド名がコマンドテーブル２４４の最後尾であると判定した場合（ステップＢ３１；Ｙｅｓ）、記憶部２８から類似コマンドテーブル２４６を読み出して、図８（ａ）の類似コマンド選択画面２０２のように番号ｍと類似コマンド名とを表示部１９に一覧表示させる（ステップＢ３５）。 If the CPU 10 determines that the standard command name selected in step B31 is the tail of the command table 244 (step B31; Yes), the CPU 10 reads the similar command table 246 from the storage unit 28, and FIG. As in the similar command selection screen 202, the number m and the similar command names are displayed as a list on the display unit 19 (step B35).

そして、例えば、「該当するコマンドの番号を数字で選択して下さい」という表示メッセージにより、選択モードに切り替わった旨を表示部１９に表示させる（ステップＢ３７）。ＣＰＵ１０は、Ａ／Ｄ変換器１３からの音声データの入力を待機し、出力された音声データに音声認識処理を施し（ステップＢ３９）、その認識結果から取得した文字列を入力文字列２４０として記憶部２８に記憶する（ステップＢ４１）。 Then, for example, the display unit 19 displays that the mode has been switched to the selection mode by a display message “Please select the corresponding command number in numbers” (step B37). The CPU 10 waits for input of voice data from the A / D converter 13, performs voice recognition processing on the output voice data (step B 39), and stores a character string obtained from the recognition result as an input character string 240. The information is stored in the unit 28 (step B41).

そして、類似コマンドテーブル２４６の先頭番号（ｍ＝１）を選択し、この番号と入力文字列２４０とを比較する（ステップＢ４３）。ＣＰＵ１０は、ステップＢ４３における比較の結果、一致すると判定した場合は（ステップＢ４５；Ｙｅｓ）、ステップＡ１７に処理を移行し、その番号ｍに対応する類似コマンド名の指示命令を実行する（ステップＡ１７）。 Then, the head number (m = 1) of the similar command table 246 is selected, and this number is compared with the input character string 240 (step B43). As a result of the comparison in step B43, when it is determined that they match (step B45; Yes), the CPU 10 shifts the processing to step A17, and executes the instruction command of the similar command name corresponding to the number m (step A17). .

一方、ステップＢ４５において、一致しないと判定した場合は（ステップＢ４５；Ｎｏ）、次の番号（ｍ＝ｍ＋１）を順次選択して（ステップＢ４９）、入力文字列２４０と番号ｍとを比較する。ＣＰＵ１０は、この番号ｍの選択と比較を繰り返し、選択した番号ｍが類似コマンドテーブル２４６の最後尾であると判定した場合は（ステップＢ４７）、再度音声入力を依頼する旨を表示部１９に表示して（ステップＢ５１）、ステップＢ２３に処理を移行する。 On the other hand, if it is determined in step B45 that they do not match (step B45; No), the next number (m = m + 1) is sequentially selected (step B49), and the input character string 240 is compared with the number m. The CPU 10 repeats the selection and comparison of the number m, and when it is determined that the selected number m is the end of the similar command table 246 (step B47), the display unit 19 displays that the voice input is requested again. (Step B51), the process proceeds to Step B23.

例えば、図８（ａ）の類似コマンド選択画面２０２の表示後、音声認識の結果、「イチ」という入力文字列２４０を取得した場合、ＣＰＵ１０は、類似コマンドテーブル２４６において番号ｍ＝１に対応する類似コマンド名「セイシガ」の指示命令を実行する。具体的には、モダリティＭから静止画の画像データを取得する静止画モードに移行して、図８（ｂ）のようなコマンド入力画面２０４を表示し、静止画モードにおけるコマンド名の入力を待機する。 For example, after the display of the similar command selection screen 202 in FIG. 8A, when the input character string 240 “I” is acquired as a result of speech recognition, the CPU 10 corresponds to the number m = 1 in the similar command table 246. An instruction command with a similar command name “Seishiga” is executed. Specifically, the mode shifts to a still image mode for acquiring still image data from the modality M, displays a command input screen 204 as shown in FIG. 8B, and waits for input of a command name in the still image mode. To do.

そして、次に入力文字列２４０として「キ」を取得した場合、「キ」という標準コマンド名をコマンドテーブル２４４を記憶していないため、「キ」で始まる標準コマンド名を抽出して、図８（ｃ）の類似コマンド選択画面２０６を表示する。ユーザは、この表示されたコマンド名の中から、所望のコマンド名の番号を音声で選択することで、図８（ｄ）のように静止画の画像データの記録を開始させる。 Then, when “ki” is acquired as the input character string 240 next, since the standard command name “ki” is not stored in the command table 244, the standard command name starting with “ki” is extracted, and FIG. The similar command selection screen 206 in (c) is displayed. The user selects a desired command name number from the displayed command names by voice, and starts recording of still image data as shown in FIG. 8D.

以上、第２実施形態によれば、標準モードにおいて音声認識して取得した入力文字列２４０と標準コマンド名とが一致しなかった場合は、コマンドテーブル２４４の中から類似コマンド名を抽出して一覧表示する。そして、類似コマンド名と共に表示した番号がユーザにより音声入力された際に、その番号に対応する類似コマンド名の指示命令を実行する。 As described above, according to the second embodiment, when the input character string 240 acquired by voice recognition in the standard mode does not match the standard command name, the similar command names are extracted from the command table 244 and listed. indicate. When the number displayed together with the similar command name is inputted by voice by the user, the instruction command for the similar command name corresponding to the number is executed.

これにより、音声認識の認識率が低下した場合にも、ユーザの音声入力に類似するコマンド名の中から、番号という短い字数の音声入力でコマンド名を選択できる。このため、転送装置１に対する指示命令をより確実に行うことができると共に、外部環境の悪い中でも、転送装置１に対する指示命令の音声入力をより迅速に行うことができる。 Thereby, even when the recognition rate of voice recognition is lowered, a command name can be selected by voice input of a short number of characters called a number from command names similar to the user's voice input. For this reason, the instruction command for the transfer device 1 can be more reliably performed, and voice input of the instruction command for the transfer device 1 can be performed more promptly even in a poor external environment.

尚、上述した第２実施形態では、類似コマンド名と共に番号を表示することとしたが、例えば、“Ａ”、“Ｂ”、“Ｃ”といったアルファベット等の文字情報を類似コマンド名と共に表示し、アルファベットにより類似コマンド名を選択可能としても同様の効果が得られることは無論である。 In the second embodiment described above, the number is displayed together with the similar command name. For example, character information such as alphabet such as “A”, “B”, “C” is displayed together with the similar command name, It goes without saying that the same effect can be obtained even if similar command names can be selected by the alphabet.

また、類似コマンド名を入力文字列２４０の先頭文字と一致するか否かによって抽出することとしたが、例えば、入力文字列２４０の先頭と最後尾の文字それぞれと、コマンド名の先頭と最後尾の文字それぞれとが一致するコマンド名を類似コマンド名として抽出することとしてもよいし、入力文字列２４０の母音の配列と一致するコマンド名を類似コマンド名として抽出することとしてもよい。このように、類似コマンド名を抽出する条件設定を変更することで、ユーザの音声入力により近似するコマンド名を抽出することができるようになる。 Further, the similar command name is extracted depending on whether or not it matches the first character of the input character string 240. For example, the first and last characters of the input character string 240 and the first and last characters of the command name are extracted. A command name that matches each of the characters may be extracted as a similar command name, or a command name that matches the vowel array of the input character string 240 may be extracted as a similar command name. As described above, by changing the condition setting for extracting the similar command name, it is possible to extract a command name that approximates the voice input of the user.

また、標準コマンド名のユーザの使用頻度に応じて、類似コマンド名の表示順序を変更することとしてもよい。この場合、コマンドテーブル２４４の標準コマンド名に、その標準コマンド名がユーザにより選択された回数を記憶しておく。そして、抽出した類似コマンド名を一覧表示する際は、対応付けられた回数の降順に整列して表示する。これにより、ユーザは、使用頻度の高いコマンド名を容易に選択して音声入力することができるようになる。 The display order of similar command names may be changed according to the frequency of use of the standard command name by the user. In this case, the standard command name in the command table 244 stores the number of times the standard command name has been selected by the user. When the extracted similar command names are displayed as a list, they are displayed in the descending order of the associated number of times. As a result, the user can easily select and input a command name that is frequently used.

また、類似コマンドテーブル２４６として記憶した類似コマンド名のうち、転送装置１の処理状態に応じて実行可能な指示命令の類似コマンド名が一つだった場合は、その指示命令をそのまま実行することとしてもよい。具体的には、図８（ｄ）のコマンド入力画面２０８の表示時に音声入力が為されて、「テ」という入力文字列２４０を取得したとする。このとき、類似コマンド名として「テンソウ」と「テイシ」を抽出する。 In addition, if the similar command name stored as the similar command table 246 has one similar command name of the instruction command that can be executed according to the processing state of the transfer apparatus 1, the instruction command is executed as it is. Also good. Specifically, it is assumed that a voice input is made when the command input screen 208 of FIG. 8D is displayed, and an input character string 240 of “te” is acquired. At this time, “Tensou” and “Toshi” are extracted as similar command names.

そして、静止画の画像データの記録中に実行可能なコマンドが「テイシ」であった場合は、そのまま画像データの記録を停止して、図８（ｅ）のようなコマンド入力画面２１０を表示させる。これにより、より迅速に転送装置１に対する指示命令を行えるようになる。 If the command that can be executed during the recording of the still image data is “taste”, the recording of the image data is stopped as it is, and the command input screen 210 as shown in FIG. 8E is displayed. . As a result, an instruction command for the transfer apparatus 1 can be performed more quickly.

また、ユーザの音声入力を待機する際には、ＣＰＵ１０が直前に実行したコマンドと関連性の高いコマンド名を一覧表示することとしてもよい。図９は、コマンド名の関連性を階層化して示したツリー構造の一例である。 Further, when waiting for the user's voice input, a list of command names highly relevant to the command executed immediately before by the CPU 10 may be displayed. FIG. 9 shows an example of a tree structure in which the relevance of command names is shown in a hierarchy.

例えば、コマンド名「ドウガ（動画）」と関連性の高いコマンド名として、「キロク（記録）」、「サイセイ（再生）」、「テンソウ（転送）」、「ショウキョ（消去）」及び「キャンセル」があり、そのうちの「キロク」と関連性の高いコマンド名として「カイシ（開始）」と「キャンセル」とがある。これらの関連性を、図９のように予め階層化しておく。 For example, as command names that are highly related to the command name “DOGA (video)”, “KIROKU (record)”, “Saisei (playback)”, “TENSO (transfer)”, “Show (erase)”, and “Cancel” Among them, “Kishi (start)” and “Cancel” are command names highly relevant to “Kirok”. These relationships are hierarchized in advance as shown in FIG.

ＣＰＵ１０は、転送装置１の初期状態において、最上層のコマンド名を図１０（ａ）のコマンド入力画面２１２のように表示させる。そして、入力文字列２４０として「セ」を取得した場合には、最上層のコマンド名のうち、「セ」で始まるコマンド名、即ち、「セイシガ（静止画）」と「セッテイ（設定）」と抽出して、図１０（ｂ）のように表示させる。 In the initial state of the transfer apparatus 1, the CPU 10 displays the uppermost command name as in the command input screen 212 of FIG. When “set” is acquired as the input character string 240, command names beginning with “set” among the command names in the uppermost layer, that is, “seisiga (still image)” and “set (set)”. Extracted and displayed as shown in FIG.

そして、例えば、「イチ」という入力文字列２４０を取得した際には、図１０（ｃ）のように静止画モードに移行するように、その入力文字列２４０に基づいて音声入力により選択された番号に対応するコマンドを実行する。このとき、選択されたコマンド名「セイシガ」の下層のコマンド名である「キロク（記録）」、「サイセイ（再生）」、「テンソウ（転送）」、「ショウキョ（消去）」及び「キャンセル」を表示させて、次の音声入力を待機する。 For example, when the input character string 240 “Ichi” is acquired, the input character string 240 is selected based on the input character string 240 so as to shift to the still image mode as shown in FIG. Execute the command corresponding to the number. At this time, the command names below the selected command name “SEISHIGA” are “KIROK (record)”, “SEISI (play)”, “Tenso (transfer)”, “Show (erase)” and “Cancel”. Display and wait for the next voice input.

次いで、入力文字列２４０として「キ」を取得した際には、図１０（ｄ）のコマンド入力画面２１８のように「キ」で始まるコマンド名を表示する。ユーザは、このコマンド名の中から「キロク（記録）」を音声入力で選択すると、図１０（ｅ）のように更に下層のコマンド名「カイシ（開始）」及び「キャンセル」が表示される。 Next, when “ki” is acquired as the input character string 240, a command name starting with “ki” is displayed as in the command input screen 218 in FIG. When the user selects “Kirok (record)” from the command names by voice input, the command names “Kai (start)” and “Cancel” are displayed as shown in FIG. 10E.

ＣＰＵ１０は、入力文字列２４０として「カ」を取得すると、「カ」で始まるコマンド名の指示命令、即ち、静止画の画像データの記録を開始し、その下層の「テイシ（停止）」を図１０（ｆ）のように表示する。このとき、「テイシ（停止）」の下層にコマンド名はない。この場合、ＣＰＵ１０は、「テ」という入力文字列２４０を取得した際には、画像データの記録を停止すると共に、「テイシ（停止）」の上層のコマンド名「カイシ（開始）」及び「キャンセル」を図１０（ｇ）のように表示する。 When the CPU 10 obtains “K” as the input character string 240, the CPU 10 starts recording an instruction command with a command name starting with “K”, that is, image data of a still image, and displays a “taste (stop)” in the lower layer. 10 (f) is displayed. At this time, there is no command name in the lower layer of “taste (stop)”. In this case, when the input character string 240 “te” is acquired, the CPU 10 stops the recording of the image data, and at the same time the command names “start (stop)” and “cancel” of “taste (stop)”. "Is displayed as shown in FIG.

このように、ユーザが選択したコマンド名と関連例の高いコマンド名が表示されるため、ユーザは、表示部１９に表示されていくコマンド名の中から逐次選択することで、転送装置１の実際の使用に即した順序でコマンド名を選択してくことができる。 As described above, since the command name selected by the user and the command name having a high related example are displayed, the user can sequentially select the command name displayed on the display unit 19 so that the transfer device 1 can be actually used. You can select command names in the order in which they are used.

医用画像出力システムのシステム構成の一例を表すブロック図。The block diagram showing an example of the system configuration of a medical image output system. 転送装置の機能構成の一例を示すブロック図。The block diagram which shows an example of a function structure of a transfer apparatus. 第１実施形態における（ａ）は記憶部、（ｂ）はコマンドテーブルそれぞれのデータ構成の一例を示す図。(A) in a 1st embodiment is a storage part, (b) is a figure showing an example of each data composition of a command table. 第１実施形態における転送装置の具体的な動作を説明するためのフローチャート。6 is a flowchart for explaining a specific operation of the transfer apparatus according to the first embodiment. 第１実施形態における転送装置の表示画面の一例。An example of the display screen of the transfer apparatus in 1st Embodiment. 第２実施形態における（ａ）は記憶部、（ｂ）はコマンドテーブル、（ｃ）は類似コマンドテーブルそれぞれのデータ構成の一例を示す図。(A) is a memory | storage part in 2nd Embodiment, (b) is a command table, (c) is a figure which shows an example of each data structure of a similar command table. 第２実施形態における転送装置の具体的な動作を説明するためのフローチャート。9 is a flowchart for explaining specific operations of the transfer apparatus according to the second embodiment. 第２実施形態における転送装置の表示画面の一例。An example of the display screen of the transfer apparatus in 2nd Embodiment. コマンド名の関連性を階層化して示したツリー構造の一例。An example of a tree structure showing the relationship of command names in a hierarchy. 変形例における転送装置の表示画面の一例。An example of the display screen of the transfer apparatus in a modification.

Explanation of symbols

Ｓ医用画像出力システム
１転送装置
９出力装置
１１音声認識機能
１２音声入力部
１３Ａ／Ｄ変換器
１４アンプ
１５マイク
１６操作入力部
１９表示部
２２通信部
２４プログラムメモリ
２６画像メモリ
２８記憶部
２４０入力文字列
２４２コマンドテーブル
２４４コマンドテーブル
２４６類似コマンドテーブル S Medical Image Output System 1 Transfer Device 9 Output Device 11 Speech Recognition Function 12 Voice Input Unit 13 A / D Converter 14 Amplifier 15 Microphone 16 Operation Input Unit 19 Display Unit 22 Communication Unit 24 Program Memory 26 Image Memory 28 Storage Unit 240 Input Character string 242 Command table 244 Command table 246 Similar command table

Claims

Voice input means;
Voice recognition means for performing voice recognition of the voice input by the voice input means and converting it into character information;
Storage means for storing a command name for performing an instruction to execute a specific process relating to a medical image and a shortened command name having a shorter number of characters than the command name;
Comparing means for comparing the character information converted by the voice recognition means with the shortened command name stored in the storage means;
Command execution means for executing an instruction instruction of the command name associated with the compared shortened command name when the comparison result by the comparison means matches;
A command input device comprising:

And further comprising selection means for selecting one of the first and second voice input modes,
The comparison means includes
When the first voice input mode is selected by the selection means, the character information converted by the voice recognition means is compared with the command name,
2. The command input device according to claim 1, wherein when the second voice input mode is selected, the character information converted by the voice recognition means is compared with the abbreviated command name.

The selection means includes
3. The command input device according to claim 2, wherein one of the first and second voice input modes is selected based on the character information converted by the voice recognition means.

Voice input means;
Voice recognition means for performing voice recognition of the voice input by the voice input means and converting it into character information;
Storage means for storing a plurality of command names for instructing execution of specific processing relating to medical images;
Extracting means for extracting a command name similar to the character information converted by the voice recognition means from the command names stored in the storage means;
A list display unit that displays a list of identification character information having a shorter number of characters than the command name in association with each command extracted by the extraction unit;
Comparison means for comparing the character information converted by the voice recognition means and the identification character information displayed in the list;
Command execution means for executing an instruction command of the command name displayed in association with the compared identification character information when the comparison result by the comparison means matches;
A command input device comprising:

The extraction means includes
5. The command input device according to claim 4, wherein a command name that matches the head of the converted character information is extracted from command names stored in the storage means.

The list display means includes:
6. The command input device according to claim 4, further comprising a frequency order display means for displaying a list of command names extracted by the extraction means in descending order of use frequency of the user.

The list display means includes:
The apparatus further comprises a related order list display means for displaying a list of command names related to the command name of the instruction command executed immediately before by the command execution means among the command names extracted by the extraction means. Item 7. The command input device according to any one of Items 4 to 6.

The execution of the specific processing relating to the medical image includes at least one of transfer to an external device, writing to a storage unit, and display on a display unit. Command input device.

Computer
Speech recognition means for performing speech recognition of speech input by the speech input means and converting it into character information;
Storage means for storing a command name for performing an instruction to execute a specific process relating to a medical image and an abbreviated command name having fewer characters than the command name;
Comparison means for comparing the character information converted by the voice recognition means with the shortened command name stored in the storage means;
Command execution means for executing an instruction instruction of the command name associated with the compared short command name when the result of comparison by the comparison means matches;
Program to function as.

Computer
Speech recognition means for performing speech recognition of speech input by the speech input means and converting it into character information;
Storage means for storing a plurality of command names for instructing execution of specific processing relating to medical images;
Extraction means for extracting a command name similar to the character information converted by the voice recognition means from the command names stored in the storage means;
List display means for displaying a list in association with identification character information for each command name extracted by the extraction means;
Comparison means for comparing the character information converted by the voice recognition means with the identification character information displayed in the list;
Command execution means for executing an instruction command of the command name displayed in association with the compared identification character information when the result of comparison by the comparison means matches;
Program to function as.