JP2001060149A

JP2001060149A - Document preparing device, and recording medium recorded with document preparation program

Info

Publication number: JP2001060149A
Application number: JP23431999A
Authority: JP
Inventors: 秀享 ▲高▼橋; Hideyuki Takahashi
Original assignee: Olympus Optical Co Ltd
Current assignee: Olympus Corp
Priority date: 1999-08-20
Filing date: 1999-08-20
Publication date: 2001-03-06

Abstract

PROBLEM TO BE SOLVED: To provide a document preparing device and a recording medium in which a document preparing processing program is recorded, capable of applying a recognizing processing to arbitrary speech data and displaying the data as character information in a prescribed area, and excellent in operability. SOLUTION: At the time of drag 1 drop to a patient name input area 23 of a medical certificate preparation window 102 by operating a mouse and selecting an audio data file 'file 1' in which 'patient name' is recorded (S4), the date/time of consultation or name of doctor recorded in the header area of the audio data file 'file 1' is read out and displayed in correspondent display areas 21 and 22 of the relevant window 102 and further, the recognized result is displayed in a patient name input area 23 by applying voice recognizing processing to the audio data of the audio data file 'file 1'. The progress from the symptoms of a disease to the first consultation, remarks and passage are similarly processed as well.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文書作成装置、詳
しくは、音声データに音声認識処理を施して文書作成を
行う文書作成装置及び文書作成処理プログラムを記録し
た記録媒体に関する。[0001] 1. Field of the Invention [0002] The present invention relates to a document creation apparatus, and more particularly, to a document creation apparatus that performs speech recognition processing on voice data to create a document, and a recording medium storing a document creation processing program.

【０００２】[0002]

【従来の技術】従来、医師の診断書等の定型文章を、例
えばワードプロセッサ等を用いて作成する場合、作成を
容易に行うため予め所定の定型文章を含むフォーマット
を用意して、係るフォーマット中の所定箇所に任意のテ
キスト形式による文章入力領域を設定する手法は、広く
知られるところである。2. Description of the Related Art Conventionally, when a fixed form such as a medical certificate of a doctor is prepared using, for example, a word processor or the like, a format including a predetermined fixed form is prepared in advance to facilitate the preparation. A method of setting a text input area in an arbitrary text format at a predetermined location is widely known.

【０００３】また、近年、音声データをパーソナルコン
ピューター等で取り込み、音声認識ソフトウェアを用い
て音声認識を行う音声認識システムも知られるところに
ある。さらに、このような音声認識システムを用い、マ
イクロフォンから入力される音声を自動的にテキスト形
式の文章に変換し、モニタ画面上に当該テキスト文章を
表示する技術手段も知られている。そして、このような
音声認識技術を用いた文章作成手法を先に述べた定型文
章のフォーマット中のテキスト文章入力に利用する手法
も種々提案されている。[0003] In recent years, there is also known a voice recognition system that captures voice data with a personal computer or the like and performs voice recognition using voice recognition software. Further, there is also known a technical means for automatically converting a voice input from a microphone into a text in a text format by using such a voice recognition system and displaying the text on a monitor screen. Various techniques have been proposed for utilizing such a text creation technique using a speech recognition technique for inputting a text text in the above-described fixed text format.

【０００４】なお、このような音声認識技術を用いたテ
キスト文章入力技術においては、ユーザーがキーボード
やマウス等によりカーソルを所定のテキスト文章入力領
域に移動した後に音声入力を行うことが一般的である。In the text text input technology using such a voice recognition technology, it is common that a user inputs a voice after moving a cursor to a predetermined text text input area by using a keyboard, a mouse, or the like. .

【０００５】一方、近年、音声等の音源をデジタル信号
に変換して記録する、いわゆるデジタル音声録音装置が
種々提案されており、本出願人は先に特開平１０−３４
０１８０号公報において、このようなデジタル音声録音
装置で一旦録音した音声データをパーソナルコンピュー
ター上で簡単な操作で取り扱うことを可能とする音声デ
ータの処理制御装置を提案している。On the other hand, in recent years, there have been proposed various so-called digital voice recording apparatuses for converting a sound source such as voice into a digital signal and recording the digital signal.
Japanese Patent Application Publication No. 0180 proposes an audio data processing control device that enables audio data once recorded by such a digital audio recording device to be handled by a simple operation on a personal computer.

【０００６】さらに、本出願人は先に特開平１０−３４
０１７９号公報において、音声データを上記音声データ
の処理制御装置から音声認識装置に渡して音声処理を行
い、文章として画面に表示させるディクテーションシス
テムを提案している。[0006] Further, the present applicant has previously disclosed in Japanese Patent Application Laid-Open No. 10-34.
No. 0179 proposes a dictation system in which voice data is passed from the voice data processing control device to a voice recognition device to perform voice processing and display the text on a screen.

【０００７】このようなディクテーションシステムによ
れば、パーソナルコンピュータ等に直接音声入力をせず
とも、一旦音声記録装置に音声を録音しておき、後にパ
ーソナルコンピュータ等に当該録音した音声データを転
送して音声認識、文章作成等を行うことが可能となる。According to such a dictation system, without directly inputting a voice to a personal computer or the like, the voice is temporarily recorded in a voice recording device, and then the recorded voice data is transferred to a personal computer or the like. It is possible to perform voice recognition, text creation, and the like.

【０００８】例えば、医師がＸ線画像を見ながら診断書
を作成する場合、いちいちキーボード等の入力装置を操
作するのは煩わしいものであるが、画像を見ながら所見
内容を上述した如き音声録音装置に入力しておき、後に
音声認識システムとして機能するパーソナルコンピュー
タ等に録音した音声データを転送して音声認識、文章作
成等を行う。For example, when a doctor prepares a medical certificate while looking at an X-ray image, it is troublesome to operate an input device such as a keyboard each time. Then, the recorded voice data is transferred to a personal computer or the like which functions as a voice recognition system later to perform voice recognition and text creation.

【０００９】[0009]

【発明が解決しようとする課題】上述した本出願人によ
る音声データの処理制御装置、ディクテーションシステ
ムは非常に有用なものであるが、これらの技術では、音
声認識専用のウィンドウで指定した録音データの全てを
一旦、音声認識によりテキスト文章化した後、キーボー
ドやマウス等を用いて当該テキスト文章中、所望の任意
部分を指定して定型文章フォーマット上の所定領域にコ
ピー（カット）アンドペーストしていたので、必ずしも
使い勝手がいいとは言えなかった。Although the processing control apparatus and dictation system for voice data by the present applicant described above are very useful, in these techniques, the recording data specified by a window dedicated to voice recognition is used. All of the text was once converted into a text by speech recognition, and then a desired arbitrary portion was designated in the text using a keyboard, a mouse, or the like, and was copied (cut) and pasted to a predetermined area in a fixed text format. Therefore, it was not always easy to use.

【００１０】すなわち、上記従来の技術では、まず音声
記録装置から転送された録音データのうち所望の範囲を
指定して、その後、定型文章フォーマット上の所定領域
を選択すると共に当該領域において指定した録音データ
に対して音声認識処理を施し、且つテキスト文章化する
ことはできなかった。[0010] That is, in the above-mentioned conventional technique, first, a desired range of the recording data transferred from the voice recording device is specified, and then a predetermined area in a fixed text format is selected, and the recording specified in the area is selected. The data could not be subjected to speech recognition processing and converted into text.

【００１１】本発明はかかる事情に鑑みてなされたもの
であり、任意の音声データに音声認識処理を施すと共に
所定の領域に文字情報として表示でき、使い勝手の良い
文書作成装置及び文書作成処理プログラムを記録した記
録媒体を提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and provides an easy-to-use document creation apparatus and a document creation processing program which can perform speech recognition processing on arbitrary voice data and display it as character information in a predetermined area. It is an object of the present invention to provide a recorded recording medium.

【００１２】[0012]

【課題を解決するための手段】上記の目的を達成するた
めに本発明の第１の文書作成装置は、プログラムされた
コンピュータによって文書作成処理をする文書作成装置
であって、少なくとも所定のテキスト入力領域の集合を
含む文書編集用ウィンドウを表示する文書編集用ウィン
ドウ表示手段と、音声データファイルに係るアイコンを
含む音声データファイル表示ウィンドウを表示する音声
データファイル表示ウィンドウ表示手段と、少なくとも
音声データファイルの選択を行うポインティング・デバ
イスにより、上記音声データファイル表示ウィンドウに
表示された音声データファイルに係るアイコンのうち任
意のアイコンが選択され上記文書編集用ウィンドウに表
示された任意のテキスト入力領域にドラッグアンドドロ
ップ操作された際、当該選択されたアイコンに対応する
音声データファイルに係る音声データに音声認識処理を
施す音声認識処理手段と、上記音声認識処理手段により
変換されたテキストを上記テキスト入力領域に表示する
音声認識結果表示手段と、を備えたことを特徴とする。In order to achieve the above object, a first document creating apparatus of the present invention is a document creating apparatus for performing a document creating process by a programmed computer, and includes at least a predetermined text input. A document editing window display for displaying a document editing window including a set of regions; an audio data file display window displaying for displaying an audio data file display window including an icon related to the audio data file; By the pointing device for selection, any icon among the icons related to the audio data file displayed in the audio data file display window is selected and dragged and dropped to an arbitrary text input area displayed in the document editing window. When operated Voice recognition processing means for performing voice recognition processing on voice data relating to the voice data file corresponding to the selected icon, and voice recognition result display means for displaying the text converted by the voice recognition processing means in the text input area And characterized in that:

【００１３】上記の目的を達成するために本発明の第２
の文書作成装置は、プログラムされたコンピュータによ
って文書作成処理をする文書作成装置であって、少なく
とも所定のテキスト入力領域の集合を含む文書編集用ウ
ィンドウを表示する文書編集用ウィンドウ表示手段と、
音声データファイルとして記録された音声の時間軸波形
を表示する音声データ波形ウィンドウを表示する音声デ
ータ波形ウィンドウ表示手段と、少なくとも音声データ
ファイルの選択を行うポインティング・デバイスによ
り、上記音声データ波形ウィンドウに表示された音声の
時間軸波形のうち任意の波形領域が選択され上記文書編
集用ウィンドウに表示された任意のテキスト入力領域に
ドラッグアンドドロップ操作された際、当該選択された
波形領域に対応する音声データに音声認識処理を施す音
声認識処理手段と、上記音声認識処理手段により変換さ
れたテキストを上記テキスト入力領域に表示する音声認
識結果表示手段と、を備えたことを特徴とする。[0013] To achieve the above object, a second aspect of the present invention is provided.
The document creation device is a document creation device that performs a document creation process by a programmed computer, document display window display means for displaying a document edit window including at least a predetermined set of text input area,
An audio data waveform window displaying means for displaying an audio data waveform window for displaying a time axis waveform of audio recorded as an audio data file, and at least a pointing device for selecting an audio data file to display in the audio data waveform window. When an arbitrary waveform area in the time axis waveform of the selected audio is selected and dragged and dropped on an arbitrary text input area displayed in the document editing window, audio data corresponding to the selected waveform area And voice recognition result display means for displaying the text converted by the voice recognition processing means in the text input area.

【００１４】上記の目的を達成するために本発明の第３
の文書作成装置は、プログラムされたコンピュータによ
って文書作成処理をする文書作成装置であって、少なく
とも所定のテキスト入力領域の集合を含む文書編集用ウ
ィンドウを表示する文書編集用ウィンドウ表示手段と、
音声データファイルに付与したインデックス情報を含む
音声データファイル表示ウィンドウを表示する音声デー
タファイル表示ウィンドウ表示手段と、少なくとも音声
データファイルの選択を行うポインティング・デバイス
により、上記音声データファイル表示ウィンドウに表示
されたインデックス情報に基づいて任意の音声データ領
域が選択され上記文書編集用ウィンドウに表示された任
意のテキスト入力領域にドラッグアンドドロップ操作さ
れた際、当該選択された音声データ領域に対応する音声
データに音声認識処理を施す音声認識処理手段と、上記
音声認識処理手段により変換されたテキストを上記テキ
スト入力領域に表示する音声認識結果表示手段と、を備
えたことを特徴とする。[0014] To achieve the above object, a third aspect of the present invention is provided.
The document creation device is a document creation device that performs a document creation process by a programmed computer, document display window display means for displaying a document edit window including at least a predetermined set of text input area,
The audio data file display window displaying means for displaying the audio data file display window including the index information assigned to the audio data file and the pointing device for selecting at least the audio data file are displayed in the audio data file display window. When an arbitrary audio data area is selected based on the index information and drag-and-drop operation is performed on an arbitrary text input area displayed in the document editing window, an audio data corresponding to the selected audio data area is output. It is characterized by comprising speech recognition processing means for performing recognition processing, and speech recognition result display means for displaying the text converted by the speech recognition processing means in the text input area.

【００１５】上記の目的を達成するために本発明の第１
の文書作成処理プログラムを記録した記録媒体は、コン
ピュータによって文書作成処理をするためのプログラム
を記録した記録媒体であって、少なくとも所定のテキス
ト入力領域の集合を含む文書編集用ウィンドウを表示す
る文書編集用ウィンドウ表示機能と、音声データファイ
ルに係るアイコンを含む音声データファイル表示ウィン
ドウを表示する音声データファイル表示ウィンドウ表示
機能と、少なくとも音声データファイルの選択を行うポ
インティング・デバイスにより、上記音声データファイ
ル表示ウィンドウに表示された音声データファイルに係
るアイコンのうち任意のアイコンが選択され上記文書編
集用ウィンドウに表示された任意のテキスト入力領域に
ドラッグアンドドロップ操作された際、当該選択された
アイコンに対応する音声データファイルに係る音声デー
タに音声認識処理を施す音声認識処理機能と、上記音声
認識処理機能により変換されたテキストを上記テキスト
入力領域に表示させる音声認識結果表示機能と、を実現
させるためのプログラムを記録する。[0015] To achieve the above object, the first aspect of the present invention is as follows.
Is a recording medium on which a program for performing a document creation process by a computer is recorded, and a document editing window for displaying a document editing window including at least a predetermined set of text input areas. Window display function for displaying an audio data file display window including an icon relating to the audio data file, and a pointing device for selecting at least the audio data file. When an arbitrary icon is selected from the icons related to the audio data file displayed in and is dragged and dropped to an arbitrary text input area displayed in the document editing window, the icon corresponding to the selected icon is selected. A program for realizing a voice recognition processing function of performing voice recognition processing on voice data related to a voice data file, and a voice recognition result display function of displaying text converted by the voice recognition processing function in the text input area. Record

【００１６】上記の目的を達成するために本発明の第２
の文書作成処理プログラムを記録した記録媒体は、コン
ピュータによって文書作成処理をするためのプログラム
を記録した記録媒体であって、少なくとも所定のテキス
ト入力領域の集合を含む文書編集用ウィンドウを表示す
る文書編集用ウィンドウ表示機能と、音声データファイ
ルとして記録された音声の時間軸波形を表示する音声デ
ータ波形ウィンドウを表示する音声データ波形ウィンド
ウ表示機能と、少なくとも音声データファイルの選択を
行うポインティング・デバイスにより、上記音声データ
波形ウィンドウに表示された音声の時間軸波形のうち任
意の波形領域が選択され上記文書編集用ウィンドウに表
示された任意のテキスト入力領域にドラッグアンドドロ
ップ操作された際、当該選択された波形領域に対応する
音声データに音声認識処理を施す音声認識処理機能と、
上記音声認識処理機能により変換されたテキストを上記
テキスト入力領域に表示させる音声認識結果表示機能
と、を実現させるためのプログラムを記録する。In order to achieve the above object, the second aspect of the present invention
Is a recording medium on which a program for performing a document creation process by a computer is recorded, and a document editing window for displaying a document editing window including at least a predetermined set of text input areas. Window display function for displaying a time axis waveform of a voice recorded as a voice data file, a voice data waveform window display function for displaying a voice data waveform window, and a pointing device for selecting at least a voice data file. When an arbitrary waveform area is selected from the time axis waveform of the audio displayed in the audio data waveform window and dragged and dropped to an arbitrary text input area displayed in the document editing window, the selected waveform is displayed. Audio to audio data corresponding to the area And voice recognition processing function of performing the identification process,
A program for realizing a speech recognition result display function for displaying the text converted by the speech recognition processing function in the text input area is recorded.

【００１７】上記の目的を達成するために本発明の第３
の文書作成処理プログラムを記録した記録媒体は、コン
ピュータによって文書作成処理をするためのプログラム
を記録した記録媒体であって、少なくとも所定のテキス
ト入力領域の集合を含む文書編集用ウィンドウを表示す
る文書編集用ウィンドウ表示機能と、音声データファイ
ルに付与したインデックス情報を含む音声データファイ
ル表示ウィンドウを表示する音声データファイル表示ウ
ィンドウ表示機能と、少なくとも音声データファイルの
選択を行うポインティング・デバイスにより、上記音声
データファイル表示ウィンドウに表示されたインデック
ス情報に基づいて任意の音声データ領域が選択され上記
文書編集用ウィンドウに表示された任意のテキスト入力
領域にドラッグアンドドロップ操作された際、当該選択
された音声データ領域に対応する音声データに音声認識
処理を施す音声認識処理機能と、上記音声認識処理機能
により変換されたテキストを上記テキスト入力領域に表
示する音声認識結果表示機能と、を実現させるためのプ
ログラムを記録する。In order to achieve the above object, the third aspect of the present invention
Is a recording medium on which a program for performing a document creation process by a computer is recorded, and a document editing window for displaying a document editing window including at least a predetermined set of text input areas. Window display function for displaying an audio data file display window including index information assigned to the audio data file, and a pointing device for selecting at least the audio data file. When an arbitrary audio data area is selected based on the index information displayed on the display window and dragged and dropped on an arbitrary text input area displayed on the document editing window, the selected audio data area is displayed. A program for realizing a voice recognition processing function of performing voice recognition processing on voice data corresponding to a region, and a voice recognition result display function of displaying text converted by the voice recognition processing function in the text input area. Record.

【００１８】[0018]

【発明の実施の形態】以下、図面を参照して本発明の実
施の形態を説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１９】図１は、本発明の第１の実施形態である文
書作成装置の概念的な全体構成を示した説明図である。FIG. 1 is an explanatory diagram showing a conceptual overall configuration of a document creating apparatus according to a first embodiment of the present invention.

【００２０】この文書作成装置は、図１に示すように、
音声を電気信号に変換して音声データ化するディジタル
レコーダ１と、このディジタルレコーダ１に着脱可能に
装着して用いられるものであって上記音声データを記録
する記録媒体たるミニチュアカード２と、このミニチュ
アカード２を図示しないＰＣカードスロットに挿入して
接続可能とするためのＰＣカードアダプタ３と、テキス
ト文章等を表示するディスプレイ５やキーボード６，マ
ウス７等を備え、ＰＣカードスロットを介して上記ミニ
チュアカード２から得た音声データに、制御プログラム
８、音声認識プログラム９あるいは文章作成プログラム
１０による処理を施す音声処理装置としてのパーソナル
コンピュータ４とを有して構成されている。This document creation device, as shown in FIG.
A digital recorder 1 for converting a sound into an electric signal and converting it into sound data; a miniature card 2 which is detachably mounted on the digital recorder 1 and used as a recording medium for recording the sound data; A PC card adapter 3 for inserting the card 2 into a PC card slot (not shown) to enable connection; a display 5 for displaying text and the like; a keyboard 6; a mouse 7; And a personal computer 4 as a voice processing device for performing processing by the control program 8, the voice recognition program 9 or the text creation program 10 on the voice data obtained from the computer 2.

【００２１】上記ディジタルレコーダ１は録音手段とし
て使用されるものであり、図示しないマイクロホンで入
力した音声をディジタル信号に変換して、記録媒体に音
声データファイルとして記録する。この記録媒体として
は例えば、ミニチュアカードとよばれる、ディジタルレ
コーダ１に着脱自在なフラッシュメモリカードが使用さ
れる。このミニチュアカード２が、例えばＰＣカードア
ダプタ３を介してパーソナルコンピュータ４に装填され
ることにより、ミニチュアカード２に記録された音声デ
ータファイルがパーソナルコンピュータ４内のハードデ
ィスク（図示せず）に転送されるようになっている。The digital recorder 1 is used as a recording means, and converts a sound inputted by a microphone (not shown) into a digital signal and records it as a sound data file on a recording medium. As this recording medium, for example, a flash memory card called a miniature card which is detachable from the digital recorder 1 is used. When the miniature card 2 is inserted into the personal computer 4 via, for example, the PC card adapter 3, the audio data file recorded on the miniature card 2 is transferred to a hard disk (not shown) in the personal computer 4. It has become.

【００２２】パーソナルコンピュータ４は、本体キーボ
ード６と、マウス７等のいわゆるポインティングデバイ
ス（入力デバイス）と、テキスト文章等の表示デバイス
であるディスプレイ５とがそれぞれ接続されて構成され
ている。そして、このパーソナルコンピュータ４のハー
ドディスク（図示せず）には、転送された音声データフ
ァイルに関する情報の表示や音声データファイルの再生
を行う制御プログラム８と、選択された音声データファ
イルを自動的に音声認識した後にディスプレイ５にテキ
スト表示することを可能とする音声認識プログラム９
と、定型文書フォーマットの所定位置にテキスト入力領
域が設定されている文書作成プログラム１０とが格納さ
れている。なお、上記制御プログラム８、音声認識プロ
グラム９、文章作成プログラム１０は、文書作成処理プ
ログラムを構成する。The personal computer 4 is constituted by connecting a main body keyboard 6, a so-called pointing device (input device) such as a mouse 7, and a display 5 which is a display device for displaying text and the like. The hard disk (not shown) of the personal computer 4 has a control program 8 for displaying information on the transferred audio data file and reproducing the audio data file, and automatically outputs the selected audio data file to the audio data file. Speech recognition program 9 capable of displaying text on display 5 after recognition
And a document creation program 10 in which a text input area is set at a predetermined position in the standard document format. The control program 8, the speech recognition program 9, and the text creation program 10 constitute a document creation processing program.

【００２３】またこれら文書作成処理プログラムは、当
該パーソナルコンピュータ４の出荷・販売時に予めハー
ドディスクＨＤＤ（この場合、パーソナルコンピュータ
４に予め内蔵されるハードディスクを想定する）に格納
されている必要はない。すなわち、当該パーソナルコン
ピュータ４に接続可能な外部記録媒体、例えば、ＣＤ−
ＲＯＭ，ＭＯ，ＦＤ等にこれら制御プログラム８、音声
認識プログラム９、文章作成プログラム１０を記録して
おき、適宜当該パーソナルコンピュータ４のハードディ
スクにインストゥールするようにしてもよい。These document creation processing programs do not need to be stored in the hard disk HDD (in this case, a hard disk built in the personal computer 4 in advance) when the personal computer 4 is shipped and sold. That is, an external recording medium connectable to the personal computer 4, for example, a CD-
The control program 8, the voice recognition program 9, and the text creation program 10 may be recorded in a ROM, MO, FD, or the like, and may be appropriately installed on the hard disk of the personal computer 4.

【００２４】さらに、上記制御プログラム８、音声認識
プログラム９、文章作成プログラム１０は、上記ＣＤ−
ＲＯＭ等の記録媒体よりパーソナルコンピュータ４のハ
ードディスクに転送されるのみならず、通信回線（有
線、無線を問わない）を介して外部の記録装置等から当
該パーソナルコンピュータ４に取り込むことも可能であ
る。Further, the control program 8, the voice recognition program 9, and the text creation program 10 are provided in the CD-ROM.
Not only can the data be transferred from a recording medium such as a ROM to the hard disk of the personal computer 4, but also can be taken into the personal computer 4 from an external recording device or the like via a communication line (wired or wireless).

【００２５】すなわち上記制御プログラム８、音声認識
プログラム９、文章作成プログラム１０が記録された媒
体（通信回線を含む）は、パーソナルコンピュータ４本
体とは別に流通・販売経路に乗ることが可能であり、本
パーソナルコンピュータ４に転送、実行されることで文
書作成装置を構成する。That is, a medium (including a communication line) on which the control program 8, the voice recognition program 9, and the text creation program 10 are recorded can be put on a distribution / sales channel separately from the personal computer 4 itself. The document is created by being transferred to the personal computer 4 and executed.

【００２６】なお、本第１の実施形態では、着脱自在な
フラッシュメモリカード（ミニチュアカード２）に記録
された音声データファイルはＰＣカードアダプタ３を介
して転送されるようにしたが、ディジタルレコーダ１と
パーソナルコンピュータ４とを直接接続して音声データ
ファイルを転送するようにしてもよいし、無線によるデ
ータ通信により転送するようにしてもよい。In the first embodiment, the audio data file recorded on the removable flash memory card (miniature card 2) is transferred via the PC card adapter 3. The audio data file may be transferred by directly connecting to the personal computer 4, or may be transferred by wireless data communication.

【００２７】また、ポインティングデバイスとしては、
マウス７の他にトラックボール、タブレット、ジョイス
ティック、ライトペン等の入力デバイスが想定される。
また、キーボード６上のキー自体でこれら入力デバイス
と同様の役目を果たすことも可能である。As a pointing device,
In addition to the mouse 7, input devices such as a trackball, a tablet, a joystick, and a light pen are assumed.
Further, the keys themselves on the keyboard 6 can also serve the same role as these input devices.

【００２８】次に、本第１の実施形態の文書作成装置を
医療診断書の作成を目的とする装置として用いた例につ
いて説明する。なお、この例では、上記音声認識プログ
ラム９、文書作成プログラム１０として、診断書作成プ
ログラムが使用される。Next, a description will be given of an example in which the document creating apparatus according to the first embodiment is used as an apparatus for creating a medical certificate. In this example, a medical certificate creation program is used as the speech recognition program 9 and the document creation program 10.

【００２９】図２は、上記制御プログラム８、診断書作
成プログラム（音声認識プログラム９、文章作成プログ
ラム１０）を起動したときのディスプレイ５の一表示例
を示した説明図である。FIG. 2 is an explanatory diagram showing one display example of the display 5 when the control program 8 and the diagnostic certificate creation program (the speech recognition program 9 and the sentence creation program 10) are activated.

【００３０】図２に示すように、このときディスプレイ
５には音声データファイル表示ウィンドウである制御ウ
ィンドウ１０１と文書編集用ウィンドウである診断書作
成ウィンドウ１０２とが表示される。As shown in FIG. 2, at this time, the display 5 displays a control window 101 as a voice data file display window and a medical certificate creation window 102 as a document editing window.

【００３１】このうち、上記制御ウィンドウ１０１は、
いわゆるメニューバー１１、ツールバー１２、ファイル
一覧表示部１３、再生表示部１４、再生制御部１５等を
有して構成される。上記ファイル一覧表示部１３は、複
数種の音声データファイルを一覧として表示する部分で
あり、各音声データファイル毎に対応するアイコン１３
ａが対応付られ、表示される。The control window 101 includes:
It has a so-called menu bar 11, a toolbar 12, a file list display section 13, a reproduction display section 14, a reproduction control section 15, and the like. The file list display section 13 is a section for displaying a plurality of types of audio data files as a list, and an icon 13 corresponding to each audio data file.
a is correlated and displayed.

【００３２】なお、本第１の実施形態では、扱う音声デ
ータファイルとして「患者氏名」、「疾病名」、「発病
から初診までの経過」、「所見および経過」等、予め音
声記録装置等に録音された音声データファイルを想定す
る。これら音声データは、それぞれｆｉｌｅ１、ｆｉｌ
ｅ２、ｆｉｌｅ３、ｆｉｌｅ４として記録され、図２に
示すように各ｆｉｌｅＸには、音声データに係るファイ
ルであることを示す所定のアイコン１３ａが対応付けら
れている。In the first embodiment, the voice data files to be handled, such as "patient name", "disease name", "the progress from onset to the first consultation", "findings and progress", are stored in advance in a voice recording device or the like. Assume a recorded audio data file. These audio data are file1, file, respectively.
The file X is recorded as e2, file3, and file4. As shown in FIG. 2, each fileX is associated with a predetermined icon 13a indicating a file related to audio data.

【００３３】一方、診断書作成ウィンドウ１０２は、
「診断日時」、「医師氏名」、「患者氏名」、「疾病
名」、「発病から初診までの経過」、「所見および経
過」の文字列表示と、上記「診断日時」、「医師氏名」
の文字列表示に対応した診断日時表示領域２１，医師氏
名表示領域２２と、上記「患者氏名」、「疾病名」、
「発病から初診までの経過」、「所見および経過」の文
字列表示にそれぞれ対応したテキスト文字入力領域であ
る、患者氏名入力領域２３，疾病名入力領域２４，発病
から初診までの経過入力領域２５，所見および経過入力
領域２６が設定されている。On the other hand, the medical certificate creation window 102
"Diagnosis date and time", "physician name", "patient name", "disease name", "elapsed time from onset to first consultation", "findings and progress" character string display, and the above "diagnosis date and time", "doctor name"
The diagnosis date / time display area 21 and the doctor name display area 22 corresponding to the character string display, and the “patient name”, “disease name”,
A patient name input area 23, a disease name input area 24, and a progress input area 25 from disease onset to first consultation, which are text character input areas respectively corresponding to the character string display of "process from onset to first consultation" and "findings and progress". , Findings and progress input area 26 are set.

【００３４】なお、上記テキスト文字入力領域である、
患者氏名入力領域２３，疾病名入力領域２４，発病から
初診までの経過入力領域２５，所見および経過入力領域
２６は、任意の領域をマウス７等のポインティングデバ
イスで指定した後、キーボード６上からテキスト文字を
入力したり、後述するように音声認識処理の結果を表示
させることが可能となっている。The text character input area is
The patient name input area 23, the disease name input area 24, the progress input area 25 from the onset of the disease to the first consultation, and the findings and progress input area 26 are designated by using a pointing device such as the mouse 7 and then text is input from the keyboard 6. It is possible to input characters and display the result of voice recognition processing as described later.

【００３５】次に、当該医療診断書作成装置において、
実際に診断書を作成する手法を図３に示すフローチャー
トを参照して説明する。なお、同図において、左側に処
理フローとして示す各ステップ（ステップＳ１〜Ｓ７）
は使用者の操作を示すものであり、これら各ステップと
太線で結ばれた右側の各処理は、上記ステップＳ１〜Ｓ
７に対応するパーソナルコンピュータ４（内設するＣＰ
Ｕ等の制御部）の処理動作を示すものである。なお、以
下、パーソナルコンピュータ４の動作としてはこのＣＰ
Ｕの動作を示すものとする。Next, in the medical certificate creating apparatus,
A method of actually creating a medical certificate will be described with reference to a flowchart shown in FIG. It should be noted that in the figure, each step shown as a processing flow on the left side (steps S1 to S7)
Indicates the operation of the user, and the respective processes on the right side, which are connected to these steps by bold lines, are performed in steps S1 to S
7 corresponding to the personal computer 4 (internal CP
U, etc.). Hereinafter, the operation of the personal computer 4 is referred to as the CP.
The operation of U is shown.

【００３６】まず、使用者は、診断書作成プログラムの
入力事項をディジタルレコーダ１にファイルに分けて録
音する。すなわち、音声データファイルｆｉｌｅ１とし
て「患者氏名」、同ｆｉｌｅ２として「疾病名」、同ｆ
ｉｌｅ３として「発病から初診までの経過」、同ｆｉｌ
ｅ４として「所見および経過」をそれぞれ録音する（ス
テップＳ１）。First, the user records the input items of the medical certificate creation program in the digital recorder 1 in the form of files. In other words, the "patient name" as the voice data file file1, the "disease name" as the file2, and the f
ile3 "The progress from onset to first consultation", same file
"Findings and progress" are recorded as e4 (step S1).

【００３７】図４は、上記各ファイルの音声メモリの記
録領域を示した説明図である。図に示すように、音声メ
モリの記録領域はインデックス情報等のヘッダー領域３
１と音声データ領域３２とに分けられる。FIG. 4 is an explanatory diagram showing a recording area of the audio memory of each file. As shown in the figure, the recording area of the audio memory is a header area 3 such as index information.
1 and an audio data area 32.

【００３８】なお、ヘッダー領域３１には、本実施形態
では、ユーザ情報、録音開始日時、録音終了日時、イン
デックスマーク（Ｉマーク）アドレス（本実施形態にお
いては、１番〜１５番）が記録されるようになってい
る。すなわち、このヘッダー領域３１には、各ファイル
毎に診断日時（録音開始日時、録音終了日時）や、医師
氏名が記録されるようになっている。In the present embodiment, the user information, the recording start date and time, the recording end date and time, and the index mark (I mark) address (No. 1 to No. 15 in this embodiment) are recorded in the header area 31. It has become so. That is, in the header area 31, the diagnosis date and time (recording start date and time, recording end date and time) and the doctor's name are recorded for each file.

【００３９】図３に戻って、使用者は、ディジタルレコ
ーダ１への録音を終えると次にＰＣカードアダプタ３を
介してミニチュアカード２をパーソナルコンピュータ４
に装填する（ステップＳ２）。これにより、録音された
音声データファイルがパーソナルコンピュータ４に転送
される（ステップＳ２１）。Returning to FIG. 3, after finishing recording on the digital recorder 1, the user then inserts the miniature card 2 into the personal computer 4 via the PC card adapter 3.
(Step S2). Thereby, the recorded voice data file is transferred to the personal computer 4 (step S21).

【００４０】次に、使用者がパーソナルコンピュータ４
を操作して制御プログラム８および診断書作成プログラ
ム（音声認識プログラム９及び文章作成プログラム１
０）を立ち上げる（ステップＳ３）と、ディスプレイ５
には制御ウィンドウ１０１および診断書作成ウィンドウ
１０２が表示される（ステップＳ３１）。Next, the user operates the personal computer 4.
To operate the control program 8 and the medical certificate creation program (the speech recognition program 9 and the sentence creation program 1)
0) (Step S3), the display 5
Displays a control window 101 and a medical certificate creation window 102 (step S31).

【００４１】次に使用者はマウス７を操作して、図５に
示すように、「患者氏名」が録音された音声データファ
イルｆｉｌｅ１に係るアイコン１３ａを選択して診断書
作成ウィンドウ１０２の患者氏名入力領域２３にドラッ
グ・アンド・ドロップする（ステップＳ４）。これによ
りＣＰＵ（パーソナルコンピュータ４に内設）はまず、
当該音声データファイルｆｉｌｅ１のヘッダ領域に記録
されている診断日時（録音開始日時、録音終了日時）
や、医師氏名の情報を読み出し、各々の情報を診断書作
成ウィンドウ１０２の診断日時表示領域２１、医師氏名
表示領域２２に表示する（ステップＳ４１）。Next, the user operates the mouse 7 to select the icon 13a corresponding to the voice data file file1 in which the "patient name" is recorded as shown in FIG. Drag and drop to the input area 23 (step S4). This allows the CPU (internal to the personal computer 4) to first
Diagnosis date and time (recording start date and time, recording end date and time) recorded in the header area of the audio data file file1
Alternatively, the information of the doctor's name is read, and each information is displayed in the diagnosis date / time display area 21 and the doctor's name display area 22 of the medical certificate creation window 102 (step S41).

【００４２】続いてＣＰＵは、音声データファイルｆｉ
ｌｅ１の音声データに対して音声認識処理を行い、その
認識結果を診断書作成ウィンドウ１０２の患者氏名入力
領域２３に表示する（ステップＳ４２）。Subsequently, the CPU sets the audio data file fi
The voice recognition processing is performed on the voice data of le1, and the recognition result is displayed in the patient name input area 23 of the medical certificate creation window 102 (step S42).

【００４３】次に使用者はマウス７を操作して、図６に
示すように、「疾病名」が録音された音声データファイ
ルｆｉｌｅ２を選択して診断書作成ウィンドウ１０２の
疾病名入力領域２４にドラッグ・アンド・ドロップする
（ステップＳ５）。これによりＣＰＵは、音声データフ
ァイルｆｉｌｅ２の音声データに対して音声認識処理を
行い、その認識結果を診断書作成ウィンドウ１０２の疾
病名入力領域２４に表示する（ステップＳ５１）。Next, the user operates the mouse 7 to select the voice data file file2 in which the "disease name" is recorded as shown in FIG. Drag and drop (step S5). As a result, the CPU performs voice recognition processing on the voice data of the voice data file file2, and displays the recognition result in the disease name input area 24 of the medical certificate creation window 102 (step S51).

【００４４】次に使用者はマウス７を操作して、図７に
示すように、「発病から初診までの経過」についての内
容が録音された音声データファイルｆｉｌｅ３を選択し
て診断書作成ウィンドウ１０２の発病から初診までの経
過入力領域２５にドラッグ・アンド・ドロップする（ス
テップＳ６）。これによりＣＰＵは、音声データファイ
ルｆｉｌｅ３の音声データに対して音声認識処理を行
い、その認識結果を診断書作成ウィンドウ１０２の発病
から初診までの経過入力領域２５に表示する（ステップ
Ｓ６１）。Next, the user operates the mouse 7 to select the voice data file file3 in which the contents of "the progress from the onset of the disease to the first consultation" are recorded as shown in FIG. Is dragged and dropped into the progress input area 25 from the onset of the disease to the first consultation (step S6). As a result, the CPU performs voice recognition processing on the voice data of the voice data file file3, and displays the recognition result in the progress input area 25 from the onset to the first consultation in the medical certificate creation window 102 (step S61).

【００４５】次に使用者はマウス７を操作して、図８に
示すように、「所見および経過」についての内容が録音
されたｆｉｌｅ４を選択して診断書作成ウィンドウ１０
２の所見および経過入力領域２６にドラッグ・アンド・
ドロップする（ステップＳ７）。これによりＣＰＵは、
ｆｉｌｅ４の音声データに対して音声認識処理を行い、
その認識結果を診断書作成ウィンドウ１０２の所見およ
び経過入力領域２６に表示する（ステップＳ７１）。Next, the user operates the mouse 7 to select the file 4 in which the contents of "findings and progress" are recorded as shown in FIG.
2 and drag and drop to the finding and progress input area 26
Drop (step S7). This allows the CPU
performs voice recognition processing on the voice data of file4,
The recognition result is displayed in the finding and progress input area 26 of the medical certificate creation window 102 (step S71).

【００４６】このように本第１の実施形態の文書作成装
置では、まず音声記録装置から転送された音声データフ
ァイルのうち所望のファイルを選択すると共に、当該フ
ァイルをテキスト文章として再現する定型文章フォーマ
ット上の所定領域を選択すると、上記選択された音声デ
ータファイルに係る音声データに対して音声認識処理を
施し、且つテキスト文字情報として文章化することがで
き、大変使い勝手の良い文書作成装置を提供することが
できる。As described above, in the document creating apparatus according to the first embodiment, first, a desired file is selected from the audio data files transferred from the audio recording apparatus, and the file is reproduced as a text document in a fixed text format. When the above-mentioned predetermined area is selected, the voice data relating to the selected voice data file is subjected to voice recognition processing and can be converted into text as text character information, thereby providing a very easy-to-use document creation device. be able to.

【００４７】なお、本第１の実施形態では、ｆｉｌｅ１
のヘッダ領域３１に記録されている録音開始日時（診断
日時情報）およびユーザ情報（医師氏名情報）を読み出
し、各々を診断書作成ウィンドウ１０２の診断日時表示
領域２１および医師氏名表示領域２２に表示するように
したが、これはｆｉｌｅ１に限らず別の音声データファ
イルから読み出すようにしてもよい。In the first embodiment, file1
The recording start date and time (diagnosis date and time information) and the user information (doctor name information) recorded in the header area 31 are read out and displayed in the diagnosis date and time display area 21 and the doctor name display area 22 of the medical certificate creation window 102, respectively. However, this is not limited to file 1 and may be read from another audio data file.

【００４８】また、上記ステップＳ４〜Ｓ７（図３参
照）までの処理の順番は上述した順番に限ったものでは
なく、使用者が自由に順番を変えて操作しても同様の効
果をもたらすことは言うまでもない。Further, the order of the processes in steps S4 to S7 (see FIG. 3) is not limited to the order described above, and the same effect can be obtained even if the user freely changes the order and operates. Needless to say.

【００４９】次に、本発明の第２実施形態の文書作成装
置について説明する。本第２の実施形態の文書作成装置
は、図１に示す限りその構成は上記第１の実施形態と同
様である。また、上記第１実施形態の文書作成装置で
は、音声データファイルとして「患者氏名」に係るｆｉ
ｌｅ１、「疾病名」に係るｆｉｌｅ２、「発病から初診
までの経過」に係るｆｉｌｅ３、「所見および経過」に
係るｆｉｌｅ４を設定し、これら音声データファイル毎
に任意のテキスト入力領域において音声認識させてテキ
スト文章化することを特徴とするものであったが、本第
２の実施形態では、各音声データファイル中の任意の区
間について選択し、この選択された区間のみをテキスト
入力領域において音声認識させてテキスト文章化するこ
とを特徴とする。その他の構成、作用については上記第
１の実施形態と同様である。Next, a document creation device according to a second embodiment of the present invention will be described. The configuration of the document creation device of the second embodiment is the same as that of the first embodiment as shown in FIG. Further, in the document creation device of the first embodiment, the fifteen related to the “patient name” is used as the voice data file.
le1, file2 relating to "disease name", file3 relating to "the progress from the onset to the first consultation", and file4 relating to "findings and progress", and performing voice recognition in an arbitrary text input area for each of these voice data files. In the second embodiment, an arbitrary section in each voice data file is selected, and only the selected section is subjected to voice recognition in the text input area. To make text sentences. Other configurations and operations are the same as those in the first embodiment.

【００５０】図９は、本第２の実施形態の文書作成装置
において、制御プログラム８、診断書作成プログラム
（音声認識プログラム９、文章作成プログラム１０）を
起動したときのディスプレイ５の一表示例を示した説明
図である。FIG. 9 shows a display example of the display 5 when the control program 8 and the diagnostic certificate creation program (the speech recognition program 9 and the sentence creation program 10) are activated in the document creation device of the second embodiment. FIG.

【００５１】図９に示すように、本第２の実施形態で
は、上記第１の実施形態でも表示される制御ウィンドウ
１０１内の、いわゆるツールバー１２にあらたに波形表
示ボタン１６を設けている。この波形表示ボタン１６が
クリックされると、当該制御ウィンドウ１０１とは別に
音声データ波形ウィンドウ１０３が開かれ、ファイル一
覧表示部１３で選択されている音声データファイルｆｉ
ｌｅＸに係る音声データの時間軸波形が表示される。As shown in FIG. 9, in the second embodiment, a waveform display button 16 is newly provided on the so-called toolbar 12 in the control window 101 also displayed in the first embodiment. When the waveform display button 16 is clicked, an audio data waveform window 103 is opened separately from the control window 101, and the audio data file fi selected in the file list display section 13 is opened.
The time axis waveform of the audio data related to leX is displayed.

【００５２】さらに、上記音声データ波形ウィンドウ１
０３に表示された時間軸音声データ波形のうち、マウス
７等のポインティングデバイスを用いて任意の区間を選
択することができる。Further, the audio data waveform window 1
An arbitrary section can be selected from the time axis audio data waveform displayed at 03 by using a pointing device such as the mouse 7.

【００５３】ここで、時間軸音声データ波形のうち任意
の区間を選択することの意義について簡単に述べる。本
実施形態の如き文書作成装置は、再現しようとする音声
データは使用者自ら録音した場合であることが多いと想
定されること、また、各音声データファイルに記録され
る音声データは、比較的短時間であると想定されること
を考慮すると、使用者にとって時間軸音声データ波形の
様子を見るだけで再現を所望する音声データの区間を読
み取ることはさして困難なことではない。むしろ波形情
報として視覚に訴えることで比較的容易に所望の音声デ
ータを探し当てることができると考えられる。Here, the significance of selecting an arbitrary section from the time axis audio data waveform will be briefly described. In the document creation apparatus according to the present embodiment, it is assumed that the audio data to be reproduced is often recorded by the user himself, and the audio data recorded in each audio data file is relatively Considering that it is assumed to be a short time, it is not very difficult for the user to read the section of the audio data desired to be reproduced only by watching the state of the time axis audio data waveform. Rather, it is considered that desired audio data can be found relatively easily by appealing visually as waveform information.

【００５４】このように、本実施形態では、録音された
音声データファイルを全て選択して再現するのではな
く、再現を所望する区間を、波形情報を見ただけで選択
し再現することができる。As described above, in the present embodiment, a section desired to be reproduced can be selected and reproduced only by looking at the waveform information, instead of selecting and reproducing all the recorded audio data files. .

【００５５】使用者は、マウス７等のポインティングデ
バイスを用いて上述のように所望の区間を選択し、さら
に選択した区間をドラッグして任意のテキスト入力領
域、例えば、所見および経過入力領域２６にドロップす
る。これにより、パーソナルコンピュータ４のＣＰＵ
は、当該選択区間の音声データについて音声認識処理を
施し、該当するテキスト文章を表示する。The user selects a desired section using the pointing device such as the mouse 7 as described above, and drags the selected section to an arbitrary text input area, for example, a finding and progress input area 26. Drop. Thereby, the CPU of the personal computer 4
Performs voice recognition processing on the voice data of the selected section, and displays a corresponding text sentence.

【００５６】その他の構成、作用については上記第１の
実施形態と同様であるので、ここでの詳しい説明は省略
する。The other configuration and operation are the same as those of the first embodiment, and the detailed description is omitted here.

【００５７】このように本第２の実施形態の文書作成装
置では、第１の実施形態の効果に加えて、選択した音声
データファイルのうち再現を所望する区間を、音声デー
タ波形を見ながら選択できるので、より的確なテキスト
文章を得ることができる。As described above, in the document creating apparatus according to the second embodiment, in addition to the effects of the first embodiment, a section desired to be reproduced in the selected audio data file is selected while looking at the audio data waveform. It is possible to obtain more accurate text sentences.

【００５８】次に、本発明の第３実施形態の文書作成装
置について説明する。本第３の実施形態の文書作成装置
は、図１に示す限りその構成は上記第１の実施形態と同
様である。また、上記第１実施形態の文書作成装置で
は、音声データファイルとして「患者氏名」に係るｆｉ
ｌｅ１、「疾病名」に係るｆｉｌｅ２、「発病から初診
までの経過」に係るｆｉｌｅ３、「所見および経過」に
係るｆｉｌｅ４を設定し、これら音声データファイル毎
に任意のテキスト入力領域において音声認識させてテキ
スト文章化することを特徴とするものであったが、本第
３の実施形態では、各音声データファイル中に付与した
インデックス情報に基づいて所定の区間をテキスト入力
領域において音声認識させてテキスト文章化することを
特徴とする。その他の構成、作用については上記第１の
実施形態と同様である。Next, a document creating apparatus according to a third embodiment of the present invention will be described. The configuration of the document creation apparatus of the third embodiment is the same as that of the first embodiment as shown in FIG. Further, in the document creation device of the first embodiment, the fifteen related to the “patient name” is used as the voice data file.
le1, file2 relating to "disease name", file3 relating to "the progress from the onset to the first consultation", and file4 relating to "findings and progress", and performing voice recognition in an arbitrary text input area for each of these voice data files. However, in the third embodiment, a predetermined section is made to be speech-recognized in the text input area based on the index information given in each audio data file. It is characterized in that Other configurations and operations are the same as those in the first embodiment.

【００５９】ところで、音声データの検索性を高めるた
めに音声データにインデックス情報を記録する技術はよ
く知られているところである。本第３の実施形態は、か
かる技術に鑑みてなされたものである。By the way, a technique for recording index information in audio data in order to enhance the searchability of audio data is well known. The third embodiment has been made in view of such technology.

【００６０】図１０は、本第３の実施形態の文書作成装
置において、制御プログラム８、診断書作成プログラム
（音声認識プログラム９、文章作成プログラム１０）を
起動したときのディスプレイ５の一表示例を示した説明
図である。FIG. 10 shows a display example of the display 5 when the control program 8 and the diagnostic certificate creation program (the speech recognition program 9 and the text creation program 10) are activated in the document creation device of the third embodiment. FIG.

【００６１】本第３の実施形態の文書作成装置に用いる
ディジタルレコーダ１には、図示しないインデックスマ
ーク釦が設けられており、録音中または再生中に該イン
デックスマーク釦を押すと、その時点のアドレス値がヘ
ッダ領域のインデックス領域に記録されるようになって
いる。なお、各インデックス領域の初期値として、特定
の値（例えばＦＦＦＦH ）が記録されているものとし、
その特定の値であればインデックスマークは記録されて
いないと判断することができる。The digital recorder 1 used in the document creating apparatus according to the third embodiment is provided with an index mark button (not shown). When the index mark button is pressed during recording or playback, the address at that time is read. The value is recorded in the index area of the header area. It is assumed that a specific value (for example, FFFFH) is recorded as an initial value of each index area,
If the value is the specific value, it can be determined that the index mark is not recorded.

【００６２】また図１０に示すように、本第３の実施形
態では、上記第１の実施形態でも表示される制御ウィン
ドウ１０１内の、再生表示部１４にインデックスマーク
表示１７が表示されるようになっている。As shown in FIG. 10, in the third embodiment, the index mark display 17 is displayed on the reproduction display section 14 in the control window 101 also displayed in the first embodiment. Has become.

【００６３】なお、インデックスマークの追加、削除
は、当該制御ウィンドウ１０１の画面上においても行う
ことができるようになっている。図１１は、本第３の実
施形態の文書作成装置において、メニューバー１３の
「編集」メニューを選択して「インデックスマークの追
加」を選択した場合の例を示した説明図である。「イン
デックスマークの追加」を実行した場合、その時の再生
位置を示すアドレスが、インデックスマークアドレスと
してインデックス領域に記録される。The addition and deletion of index marks can be performed on the screen of the control window 101. FIG. 11 is an explanatory diagram showing an example of a case where the “edit” menu on the menu bar 13 is selected and “add index mark” is selected in the document creating apparatus according to the third embodiment. When the “addition of index mark” is executed, an address indicating the reproduction position at that time is recorded in the index area as an index mark address.

【００６４】図１２〜図１４は、本実施形態の文書作成
装置において、音声認識対象区間の設定手法を説明する
図である。FIGS. 12 to 14 are diagrams for explaining a method of setting a speech recognition target section in the document creation apparatus according to the present embodiment.

【００６５】いま、１つの音声データファイル中に「患
者氏名」、「疾病名」、「発病から初診までの経過」、
「所見および経過」についてのメッセージが録音されて
おり、各内容の間にインデックスマークＡ、Ｂ、Ｃが記
録されているとする。このとき、再生表示部は図１２の
ように表示される。Now, in one voice data file, “patient name”, “disease name”, “elapsed time from onset to first consultation”,
It is assumed that a message about "findings and progress" is recorded, and index marks A, B, and C are recorded between the contents. At this time, the reproduction display section is displayed as shown in FIG.

【００６６】ここで、例えば、「発病から初診までの経
過」についての内容の音声認識処理を行いたいときは、
マウス７等のポインティングデバイスを操作して、その
「発病から初診までの経過」についてのメッセージのあ
るインデックスマークＢ、Ｃで挟まれた区間にマウスポ
インタ１８を合わせる。そして、このマウスポインタ１
８を合わせた状態でマウス７をクリックすると、図１３
に示すように、当該区間が選択表示される。なお、この
選択表示は、再生表示部１４上で反転表示されるもので
もよいし、他の色で表示されるものでもよい。図１４
は、「所見および経過」についてのメッセージがある区
間を選択した例である。Here, for example, when it is desired to perform the speech recognition processing of the content of “the progress from the onset of the disease to the first consultation”,
By operating a pointing device such as the mouse 7, the mouse pointer 18 is set to a section sandwiched between index marks B and C with a message about “the progress from the onset to the first consultation”. And this mouse pointer 1
When the mouse 7 is clicked while the mouse 8 is aligned, FIG.
As shown in the figure, the section is selectively displayed. The selection display may be reversed on the reproduction display unit 14 or may be displayed in another color. FIG.
Is an example in which a section having a message about “findings and progress” is selected.

【００６７】このように音声認識の対象区間を選択した
状態で、診断書作成ウィンドウ１０２の該当する入力領
域にドラッグ・アンド・ドロップすると、当該選択され
た区間に対して音声認識処理が行われ、テキストが表示
される。When the target section for speech recognition is selected and dragged and dropped into the corresponding input area of the medical certificate creation window 102, the speech recognition processing is performed on the selected section. The text is displayed.

【００６８】その他の構成、作用については上記第１の
実施形態と同様であるので、ここでの詳しい説明は省略
する。The other configuration and operation are the same as those of the first embodiment, so that the detailed description is omitted here.

【００６９】このように本第３の実施形態の文書作成装
置では、第１の実施形態の効果に加えて、選択した音声
データファイルのうち再現を所望する区間を、インデッ
クス情報に基づき選択できるので、より的確なテキスト
文章を得ることができる。As described above, in the document creating apparatus according to the third embodiment, in addition to the effects of the first embodiment, a section desired to be reproduced can be selected from the selected audio data file based on the index information. , You can get more accurate text sentences.

【００７０】また、上述した各実施形態においては、テ
キスト表示する対象音声データとして、デジタルレコー
ダを使用して予め録音した音声データを例に挙げたが、
これに限ることなく、マイクロホンを直接パーソナルコ
ンピュータに接続して録音した音声データを利用しても
良い。In each of the above-described embodiments, audio data recorded in advance using a digital recorder has been described as an example of audio data to be displayed as text.
The present invention is not limited to this, and sound data recorded by directly connecting a microphone to a personal computer may be used.

【００７１】このように上記各実施形態によれば、任意
の音声データまたはその任意の区間を指定して、定型文
書フォーマット上の任意のテキスト入力領域で音声認識
を行い、テキスト化することを可能とする、使い勝手の
よい文書作成装置を提供することができる。As described above, according to each of the above-described embodiments, it is possible to designate any voice data or any section thereof, perform voice recognition in any text input area in the standard document format, and convert it to text. Thus, an easy-to-use document creation device can be provided.

【００７２】[付記]以上詳述した如き本発明の実施形態
によれば、以下の如き構成を得ることができる。即ち、（１）複数の文書入力領域を規定した所定のフォーマッ
トを表示する第１表示画面（診断書作成ウィンドウ１０
２）と、音声データを記録した複数の音声ファイルの識
別子を表示する第２表示画面（制御ウィンドウ１０１）
と、上記第２表示画面に表示した音声ファイルの一つを
選択し、この選択情報を上記第１表示画面に表示した文
書入力領域の一つに入力する操作手段（マウス７）と、
を具備し、上記操作手段により選択された音声ファイル
に記録された音声データを文字に変換する音声認識処理
を行い、音声認識処理された文字を上記第１表示画面の
所定の文書入力領域に表示することを特徴とする文書作
成装置。[Appendix] According to the embodiment of the present invention as described in detail above, the following configuration can be obtained. That is, (1) a first display screen (a medical certificate creation window 10) for displaying a predetermined format defining a plurality of document input areas.
2) and a second display screen (control window 101) for displaying identifiers of a plurality of audio files in which audio data is recorded.
Operating means (mouse 7) for selecting one of the audio files displayed on the second display screen and inputting the selected information to one of the document input areas displayed on the first display screen;
Performing voice recognition processing for converting voice data recorded in the voice file selected by the operation means into characters, and displaying the voice-recognized characters in a predetermined document input area of the first display screen. A document creation apparatus characterized by performing the following.

【００７３】（２）複数の文書入力領域を規定した所定
のフォーマットを表示する第１表示画面（診断書作成ウ
ィンドウ１０２）と、音声データの時間軸波形を表示す
る第２表示画面（音声データ波形ウィンドウ１０３）
と、上記第２表示画面に表示した音声データの時間軸波
形の所定領域を選択し、この選択情報を上記第１表示画
面に表示した文書入力領域の一つに入力する操作手段
（マウス７）と、を具備し、上記操作手段により選択さ
れた所定領域に記録された音声データを文字に変換する
音声認識処理を行い、音声認識処理された文字を上記第
１表示画面の所定の文書入力領域に表示することを特徴
とする文書作成装置。(2) A first display screen (diagnostic report creation window 102) for displaying a predetermined format defining a plurality of document input areas, and a second display screen (audio data waveform) for displaying a time axis waveform of audio data. Window 103)
Operating means (mouse 7) for selecting a predetermined area of the time axis waveform of the audio data displayed on the second display screen and inputting this selection information to one of the document input areas displayed on the first display screen And performs voice recognition processing for converting voice data recorded in a predetermined area selected by the operation means into characters, and converts the voice-recognized characters into a predetermined document input area on the first display screen. A document creation apparatus characterized by displaying on a document.

【００７４】（３）複数の文書入力領域を規定した所定
のフォーマットを表示する第１表示画面（診断書作成ウ
ィンドウ１０２）と、音声データを記録した音声ファイ
ルに付与したインデックス情報を表示する第２表示画面
（制御ウィンドウ１０１）と、上記第２表示画面に表示
したインデックス情報に基づいて、上記音声データの所
定領域を選択し、この選択情報を上記第１表示画面に表
示した文書入力領域の一つに入力する操作手段（マウス
７）と、を具備し、上記操作手段により選択された所定
領域に記録された音声データを文字に変換する音声認識
処理を行い、音声認識処理された文字を上記第１表示画
面の所定の文書入力領域に表示することを特徴とする文
書作成装置。(3) A first display screen (diagnostic report creation window 102) for displaying a predetermined format defining a plurality of document input areas, and a second display for displaying index information added to a voice file in which voice data is recorded. A predetermined area of the audio data is selected based on the display screen (control window 101) and the index information displayed on the second display screen, and the selected information is selected from the document input area displayed on the first display screen. Operating means (mouse 7) for inputting the voice data, performing voice recognition processing for converting voice data recorded in the predetermined area selected by the operating means into characters, and converting the voice-recognized characters to the above-mentioned characters. A document creation device for displaying in a predetermined document input area of a first display screen.

【００７５】（４）複数の文書入力領域を規定した所定
のフォーマットを表示する第１表示画面（診断書作成ウ
ィンドウ１０２）と、音声データの区分情報を表示する
第２表示画面（制御ウィンドウ１０１、音声データ波形
ウィンドウ１０３）と、上記第２表示画面に表示した区
分情報に基づいて、上記音声データの所定領域を選択
し、この選択情報を上記第１表示画面に表示した文書入
力領域の一つに入力する操作手段（マウス７）と、を具
備し、上記操作手段により選択された所定領域に記録さ
れた音声データを文字に変換する音声認識処理を行い、
音声認識処理された文字を上記第１表示画面の所定の文
書入力領域に表示することを特徴とする文書作成装置。(4) A first display screen (diagnostic report creation window 102) for displaying a predetermined format defining a plurality of document input areas, and a second display screen (control window 101, Based on the audio data waveform window 103) and the division information displayed on the second display screen, a predetermined area of the audio data is selected, and this selected information is selected as one of the document input areas displayed on the first display screen. And a voice recognition process for converting voice data recorded in a predetermined area selected by the operation means into characters.
A document creation apparatus, wherein a character subjected to voice recognition processing is displayed in a predetermined document input area of the first display screen.

【００７６】[0076]

【発明の効果】以上説明したように本発明によれば、任
意の音声データに音声認識処理を施すと共に所定の領域
に文字情報として表示でき、使い勝手の良い文書作成装
置及び文書作成処理プログラムを記録した記録媒体を提
供することができる。As described above, according to the present invention, it is possible to perform a voice recognition process on arbitrary voice data and display it as character information in a predetermined area, and to record a user-friendly document preparation device and a document preparation processing program. Recording medium can be provided.

[Brief description of the drawings]

【図１】本発明の第１の実施形態である文書作成装置の
概念的な全体構成を示した説明図である。FIG. 1 is an explanatory diagram showing a conceptual overall configuration of a document creation device according to a first embodiment of the present invention.

【図２】上記第１の実施形態の文書作成装置において、
制御プログラム、診断書作成プログラム（音声認識プロ
グラム、文章作成プログラム）を起動したときのディス
プレイの一表示例を示した説明図である。FIG. 2 is a diagram illustrating the document creation apparatus according to the first embodiment;
FIG. 4 is an explanatory diagram showing a display example of a display when a control program and a medical certificate creating program (a voice recognition program and a sentence creating program) are activated.

【図３】上記第１の実施形態の文書作成装置において、
診断書を作成する手法を示したフローチャートである。FIG. 3 is a diagram illustrating the document creating apparatus according to the first embodiment;
5 is a flowchart illustrating a method for creating a medical certificate.

【図４】上記第１の実施形態の文書作成装置において、
各音声データファイルの音声メモリの記録領域を示した
説明図である。FIG. 4 is a diagram illustrating the document creating apparatus according to the first embodiment;
FIG. 3 is an explanatory diagram showing a recording area of an audio memory of each audio data file.

【図５】上記第１の実施形態の文書作成装置において、
「患者氏名」が録音された音声データファイルを選択し
て患者氏名入力領域にドラッグ・アンド・ドロップする
様子を示した説明図である。FIG. 5 is a diagram illustrating the document creating apparatus according to the first embodiment.
FIG. 11 is an explanatory diagram showing a state in which an audio data file in which a “patient name” is recorded is dragged and dropped into a patient name input area.

【図６】上記第１の実施形態の文書作成装置において、
「疾病名」が録音された音声データファイルを選択して
患者氏名入力領域にドラッグ・アンド・ドロップする様
子を示した説明図である。FIG. 6 is a diagram illustrating the document creation apparatus according to the first embodiment;
FIG. 11 is an explanatory diagram showing a state where a voice data file in which a “disease name” is recorded is dragged and dropped into a patient name input area.

【図７】上記第１の実施形態の文書作成装置において、
「発病から初診までの経過」が録音された音声データフ
ァイルを選択して患者氏名入力領域にドラッグ・アンド
・ドロップする様子を示した説明図である。FIG. 7 is a diagram illustrating the document creation apparatus according to the first embodiment;
FIG. 11 is an explanatory diagram showing a state in which a voice data file in which “the progress from the onset of disease to the first consultation” is recorded is dragged and dropped into a patient name input area.

【図８】上記第１の実施形態の文書作成装置において、
「所見および経過」が録音された音声データファイルを
選択して患者氏名入力領域にドラッグ・アンド・ドロッ
プする様子を示した説明図である。FIG. 8 is a diagram illustrating the document creation apparatus according to the first embodiment;
FIG. 11 is an explanatory diagram showing a state in which a voice data file in which “findings and progress” are recorded is dragged and dropped into a patient name input area.

【図９】本発明の第２の実施形態の文書作成装置におい
て、制御プログラム、診断書作成プログラム（音声認識
プログラム、文章作成プログラム）を起動したときのデ
ィスプレイの一表示例を示した説明図である。FIG. 9 is an explanatory diagram showing a display example of a display when a control program and a medical certificate creation program (a speech recognition program and a text creation program) are activated in the document creation device according to the second embodiment of the present invention. is there.

【図１０】本発明の第３の実施形態の文書作成装置にお
いて、制御プログラム、診断書作成プログラム（音声認
識プログラム、文章作成プログラム）を起動したときの
ディスプレイの一表示例を示した説明図である。FIG. 10 is an explanatory diagram showing one display example of a display when a control program and a diagnostic certificate creation program (a speech recognition program and a text creation program) are activated in the document creation device according to the third embodiment of the present invention. is there.

【図１１】上記第３の実施形態の文書作成装置におい
て、制御ウィンドウ上でインデックスマークの追加、削
除を行う際の様子を示した説明図である。FIG. 11 is an explanatory diagram showing a state when adding or deleting an index mark on a control window in the document creating apparatus according to the third embodiment.

【図１２】上記第３の実施形態の文書作成装置におい
て、音声認識対象区間の設定手法を説明する図である。FIG. 12 is a diagram illustrating a method for setting a speech recognition target section in the document creation device according to the third embodiment.

【図１３】上記第３の実施形態の文書作成装置におい
て、音声認識対象区間の設定手法を説明する図である。FIG. 13 is a diagram illustrating a method for setting a speech recognition target section in the document creation device according to the third embodiment.

【図１４】上記第３の実施形態の文書作成装置におい
て、音声認識対象区間の設定手法を説明する図である。FIG. 14 is a diagram illustrating a method for setting a speech recognition target section in the document creation device according to the third embodiment.

[Explanation of symbols]

１…ディジタルレコーダ２…ミニチュアカード３…ＰＣカードアダプタ４…パーソナルコンピュータ５…ディスプレイ６…キーボード７…マウス８…制御プログラム９…音声認識プログラム１０…文章作成プログラム１３…ファイル一覧表示部１３ａ…アイコン１０１…制御ウィンドウ１０２…診断書作成ウィンドウ１０３…音声データ波形ウィンドウ DESCRIPTION OF SYMBOLS 1 ... Digital recorder 2 ... Miniature card 3 ... PC card adapter 4 ... Personal computer 5 ... Display 6 ... Keyboard 7 ... Mouse 8 ... Control program 9 ... Speech recognition program 10 ... Sentence creation program 13 ... File list display section 13a ... Icon 101 Control window 102: Medical certificate creation window 103: Voice data waveform window

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 15/28 Ｇ１０Ｌ 3/00 ５５１Ｐ５７１Ｋ ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G10L 15/28 G10L 3/00 551P 571K

Claims

[Claims]

1. A document creation apparatus for performing document creation processing by a programmed computer, comprising: a document editing window display means for displaying a document editing window including at least a set of predetermined text input areas; and an audio data file. A voice data file display window displaying means for displaying a voice data file display window including the icon according to the above, and at least a pointing device for selecting a voice data file, the voice data file displayed in the voice data file display window When any of the icons is selected and dragged and dropped to any text input area displayed in the document editing window, the audio data associated with the audio data file corresponding to the selected icon is displayed. Document creating apparatus characterized by comprising: a voice recognition processing means for performing voice recognition processing, a voice recognition result display means for the converted text by the speech recognition processing means for displaying on the text input region.

2. A recording medium storing a program for performing a document creation process by a computer, wherein a document editing window display function for displaying a document editing window including at least a set of predetermined text input areas, An audio data file display window display function for displaying an audio data file display window including an icon relating to a data file; and a voice data file displayed in the audio data file display window by at least a pointing device for selecting an audio data file. When an arbitrary icon is selected from among the icons according to and the drag-and-drop operation is performed on an arbitrary text input area displayed in the document editing window, the audio data relating to the audio data file corresponding to the selected icon is displayed. A voice recognition processing function for performing speech recognition processing on the data, a recording medium the converted text by the speech recognition processing function recording a program for realizing the voice recognition result display function of displaying on the text entry area.

3. A document creation apparatus for performing document creation processing by a programmed computer, comprising: a document editing window display means for displaying a document editing window including at least a set of predetermined text input areas; Audio data waveform window display means for displaying an audio data waveform window for displaying a time axis waveform of audio recorded as audio data, and a voice device displayed in the audio data waveform window by at least a pointing device for selecting an audio data file. When an arbitrary waveform area is selected from the time axis waveforms and drag-and-drop operation is performed on an arbitrary text input area displayed in the document editing window, voice recognition corresponding to the selected waveform area is performed. Voice recognition processing means for performing processing; A speech recognition result display unit for displaying the text converted by the speech recognition processing unit in the text input area.

4. A recording medium on which a program for performing a document creation process by a computer is recorded, wherein a document editing window display function for displaying a document editing window including at least a predetermined set of text input areas; An audio data waveform window display function for displaying an audio data waveform window for displaying a time axis waveform of audio recorded as a data file, and a pointing device for selecting at least an audio data file, are displayed in the audio data waveform window. When an arbitrary waveform area is selected from the time axis waveform of the audio and dragged and dropped to an arbitrary text input area displayed in the document editing window, audio data corresponding to the selected waveform area is generated. Voice recognition processing function to perform voice recognition processing and Recording medium for recording a program for realizing the voice recognition result display function of displaying on the text input area the converted text by the speech recognition processing function.

5. A document creation apparatus for performing document creation processing by a programmed computer, comprising: a document editing window display means for displaying a document editing window including at least a predetermined set of text input areas; and an audio data file. A voice data file display window displaying means for displaying a voice data file display window including the index information given to the voice data file, and at least a pointing device for selecting a voice data file. When an arbitrary voice data area is selected based on the selected voice data area and a drag and drop operation is performed on an arbitrary text input area displayed in the document editing window, voice recognition processing is performed on the voice data corresponding to the selected voice data area. Document creation device, wherein the speech recognition processing unit, and the speech recognition result display means for the converted text by the speech recognition processing means for displaying on the text entry area, further comprising a performing.

6. A recording medium on which a program for performing a document creation process by a computer is recorded, wherein a document editing window display function for displaying a document editing window including at least a predetermined set of text input areas, An audio data file display window display function for displaying an audio data file display window including index information assigned to the data file, and an index displayed in the audio data file display window by at least a pointing device for selecting an audio data file. When an arbitrary voice data area is selected based on the information and a drag-and-drop operation is performed on an arbitrary text input area displayed in the document editing window, voice recognition is performed on the voice data corresponding to the selected voice data area. A voice recognition processing function for performing processing, a recording medium that the converted text by the speech recognition processing function recording a program for realizing the voice recognition result display function of displaying on the text entry area.