JPH07152756A

JPH07152756A - Document processor

Info

Publication number: JPH07152756A
Application number: JP5299302A
Authority: JP
Inventors: Tomonori Kudou; 朋紀工藤
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1993-11-30
Filing date: 1993-11-30
Publication date: 1995-06-16

Abstract

PURPOSE:To easily execute maintenance and to reduce capacity by collecting dictionaries for respective purposes to one dictionary in processing such as KANA (Japanese syllabary)/KANJI (Chinese character) conversion, voice synthesis and voice recognition requiring dictionaries. CONSTITUTION:In the case of executing KANA/KANJI conversion, reading is inputted, notation corresponding to the reading is retrieved from a dictionary reading part 33 and a KANA sentence is converted into a KANA/KANJI mixed sentence. At the time of voice synthesis, a notation string is inputted, voice data corresponding to the input are retrieved from a notation part 34 and voice data are read out and outputted as voice. In the case of executing voice recognition, voice data are inputted and notation corresponding to the input is retrieved from a voice part 35 to convert the voice data into the notation. In the case of executing dictionary maintenance, any one of reading, notation and voice is inputted and a record corresponding to the input is adopted as an object to be maintained. Thereby the consistency of contents in the dictionary can be maintained and labor and time for the maintenance of the dictionary is made unnecessary.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、例えばかな漢字変換
等、辞書を用いた文書処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document processing device using a dictionary, such as kana-kanji conversion.

【０００２】[0002]

【従来の技術】従来、かな漢字変換処理等、辞書を用い
て遂行される処理は幅広く行われている。このような処
理においては、例えばかな漢字変換にはそのための辞書
が必要であり、また、音声合成や音声認識を行なうので
あれば、音声合成や音声認識用辞書といったように、処
理にあわせた別々の辞書が必要であった。2. Description of the Related Art Conventionally, processing performed using a dictionary such as kana-kanji conversion processing has been widely performed. In such a process, for example, a kana-kanji conversion requires a dictionary for that purpose, and if voice synthesis or voice recognition is to be performed, a separate dictionary suitable for the process, such as a voice synthesis or voice recognition dictionary, is used. I needed a dictionary.

【０００３】[0003]

【発明が解決しようとする課題】このように前述した従
来の処理では、処理に応じた辞書が必要であったため、
辞書を格納するのに必要とされる記憶容量が大きくなる
という問題があった。また、例えば、かな漢字変換や音
声合成、音声認識といった同じ言語を扱う処理に用いる
辞書の場合には、各辞書に同じ単語が登録されているこ
とが多く、１つの登録内容を１つの辞書について修正す
ると、すべての辞書を修正しなければならないなど、辞
書の保守が容易ではなかった。As described above, the conventional processing described above requires a dictionary corresponding to the processing.
There is a problem that the storage capacity required to store the dictionary becomes large. Further, for example, in the case of dictionaries used for processing the same language such as kana-kanji conversion, voice synthesis, and voice recognition, the same word is often registered in each dictionary, and one registered content is corrected for one dictionary. Then, all the dictionaries had to be modified, and maintenance of the dictionaries was not easy.

【０００４】本発明は上記従来例に鑑みてなされたもの
で、複数の用途に供する辞書の容量を小さく抑え、その
保守を容易にした文書処理装置を提供することを目的と
する。The present invention has been made in view of the above-mentioned conventional example, and an object of the present invention is to provide a document processing apparatus in which the capacity of a dictionary used for a plurality of purposes is kept small and the maintenance thereof is facilitated.

【０００５】[0005]

【課題を解決するための手段】および[Means for Solving the Problems] and

【作用】上記目的を達成するために本発明の文書処理装
置は次のような構成からなる。In order to achieve the above object, the document processing apparatus of the present invention has the following configuration.

【０００６】すなわち、単語に関する複数の属性が、対
応する単語の見出しとして登録され、同一の単語に関す
る複数の属性が対応付けされている辞書と、所望の単語
の所定の属性から対応する単語の他の属性を前記辞書よ
り検索する検索手段と、該検索手段により検索された属
性に基づいて次段の処理を行なう手段とを備える。That is, a plurality of attributes related to a word are registered as headings of the corresponding word, and a dictionary in which a plurality of attributes related to the same word are associated with each other And a means for performing the next process based on the attribute retrieved by the retrieval means.

【０００７】この構成により、複数の属性を有する辞書
からそのいずれかの属性を指定して、それに対応する単
語の他の属性を引き、それを用いて次段の処理を行なう
ことができる。このため、辞書は１つで済み辞書の大き
さを縮小できる。With this configuration, it is possible to specify any one of the attributes from a dictionary having a plurality of attributes, subtract other attributes of the corresponding word, and use that to perform the next process. Therefore, only one dictionary can be used to reduce the size of the dictionary.

【０００８】更に望ましくは、前記辞書に対する保守の
指定を行なう指定手段と、前記検索手段により検索され
た単語の属性に関して、前記指定手段により指定された
様に保守する保守手段とをさらに備える。More preferably, it further comprises a designating unit for designating maintenance for the dictionary, and a maintenance unit for maintaining the attribute of the word retrieved by the retrieval unit as designated by the designating unit.

【０００９】更に望ましくは、前記指定手段は、単語の
追加か削除か属性の変更かいずれかの指定と、対象とな
る単語の属性の指定とを行い、前記保守手段は、追加の
場合には指定された属性を見出しとして前記辞書に登録
し、削除の場合には指定された属性に対応する単語に関
する属性全てを辞書より削除し、変更の場合には指定さ
れた属性を指定された値に変更する。More preferably, the designating means designates addition or deletion of a word or change of an attribute and designation of an attribute of a target word, and the maintenance means in the case of addition. The specified attribute is registered as a headline in the dictionary, in the case of deletion, all attributes related to the word corresponding to the specified attribute are deleted from the dictionary, and in the case of change, the specified attribute is set to the specified value. change.

【００１０】この構成により、１つの単語に対応する属
性をまとめて保守の対象とすることができ、保守の手間
が省け辞書の一貫性を保つことができる。With this configuration, the attributes corresponding to one word can be collectively targeted for maintenance, the maintenance work can be saved, and the consistency of the dictionary can be maintained.

【００１１】更に望ましくは、前記複数の属性は、読み
と表記と音声データとを含み、前記次段の処理は、かな
漢字変換と音声合成と音声認識を含む。More preferably, the plurality of attributes include reading, notation and voice data, and the processing in the next stage includes kana-kanji conversion, voice synthesis and voice recognition.

【００１２】この構成により、かな漢字変換と音声合成
と音声認識とを単一の辞書を用いて行なうことができ、
この辞書の保守が容易で、小容量で済む。With this configuration, kana-kanji conversion, voice synthesis and voice recognition can be performed using a single dictionary,
This dictionary is easy to maintain and requires a small capacity.

【００１３】更に望ましくは、前記複数の属性は、読み
と表記と文字認識データを含み、前記次段の処理は文字
認識とかな漢字変換とを含む。More preferably, the plurality of attributes include reading, notation and character recognition data, and the processing in the next stage includes character recognition and kana-kanji conversion.

【００１４】この構成により、文字認識とかな漢字変換
とを単一の辞書を用いて行なうことができ、この辞書の
保守が容易で、小容量で済む。With this configuration, character recognition and kana-kanji conversion can be performed using a single dictionary, which is easy to maintain and requires a small capacity.

【００１５】更に望ましくは、前記複数の属性は、読み
と表記と音声データと文字認識データとを含み、前記次
段の処理はかな漢字変換と音声合成と音声認識と文字認
識とを含む。More preferably, the plurality of attributes include reading and notation, voice data, and character recognition data, and the subsequent processing includes kana-kanji conversion, voice synthesis, voice recognition, and character recognition.

【００１６】この構成により、かな漢字変換と音声合成
と音声認識と文字認識とを単一の辞書を用いて行なうこ
とができ、この辞書の保守が容易で、小容量で済む。With this configuration, kana-kanji conversion, voice synthesis, voice recognition and character recognition can be performed using a single dictionary, and this dictionary is easy to maintain and requires a small capacity.

【００１７】[0017]

【第１実施例】次に本発明の第１の実施例である、かな
漢字変換・音声合成・音声認識機能を備える情報処理シ
ステムについて説明する。[First Embodiment] An information processing system having a kana-kanji conversion / voice synthesis / voice recognition function, which is a first embodiment of the present invention, will be described below.

【００１８】図２は、本実施例の情報処理システムの構
成を表したブロック図である。FIG. 2 is a block diagram showing the configuration of the information processing system of this embodiment.

【００１９】コード入力を行なうキーボード、座標入力
を行なうマウス、画像入力を行なう画像スキャナ、音声
入力を行なうマイクロホンなどの入力装置２１から入力
された入力データは、中央演算処理装置（ＣＰＵ）２２
によって処理され、その処理結果はＲＡＭやディスクメ
モリなどの記憶装置２４に記憶され、画像を出力するＣ
ＲＴやプリンタや、音声を出力するスピーカといった出
力装置２３から出力される。なお、入力装置２１は、音
声入力についてはそれをデジタル化する機構を備えてい
る。また、入力装置２１は、ディスク装置などの外部記
憶装置や通信ネットワークから入力するものであっても
良い。Input data input from an input device 21, such as a keyboard for code input, a mouse for coordinate input, an image scanner for image input, and a microphone for voice input, is processed by a central processing unit (CPU) 22.
Is processed by C, and the processing result is stored in a storage device 24 such as a RAM or a disk memory and outputs an image
It is output from the output device 23 such as an RT, a printer, or a speaker that outputs sound. The input device 21 has a mechanism for digitizing a voice input. The input device 21 may be an input device such as an external storage device such as a disk device or a communication network.

【００２０】図１は本実施例のシステムに備えられたモ
ジュールのブロック図である。図１のように、１つの辞
書４を、かな漢字変換部１，音声合成部２，音声認識部
３により使用する。各モジュールによる処理は記憶装置
２４に格納されたプログラムをＣＰＵ２２が実行するこ
とで実現される。また、辞書４は記憶装置２４に格納さ
れている。FIG. 1 is a block diagram of modules included in the system of this embodiment. As shown in FIG. 1, one dictionary 4 is used by the kana-kanji conversion unit 1, the voice synthesis unit 2, and the voice recognition unit 3. The processing by each module is realized by the CPU 22 executing a program stored in the storage device 24. The dictionary 4 is stored in the storage device 24.

【００２１】＜辞書の構成＞図３は辞書４の一例の全体
構成を示した図である。辞書４は、文字索引３１と音声
索引３２と読み部３３と表記部３４と音声部３５とを含
んでいる。図において、文字索引３１は、キーとなる読
みあるいは表記から、該当するレコードを検索するため
の索引であり、音声索引３２はキーとなる音声データか
ら、それに該当するレコードを検索するための索引であ
る。<Structure of Dictionary> FIG. 3 is a diagram showing the overall structure of an example of the dictionary 4. The dictionary 4 includes a character index 31, a voice index 32, a reading unit 33, a writing unit 34, and a voice unit 35. In the figure, a character index 31 is an index for retrieving a corresponding record from a key reading or notation, and a voice index 32 is an index for retrieving a corresponding record from key voice data. is there.

【００２２】読み部３３は、かなやローマ字による読み
からその表記を引くためのデータが集積された部分であ
る。そのレコード（読み部単語）は、図４（ａ）に示さ
れているとおり、見出しとなる読み列と、それに対応す
る表記を示すレコード（表記部単語）へと繋がるポイン
タと、品詞や活用形などの文法情報等とが格納されてい
る。The reading part 33 is a part in which data for subtracting the notation from the kana or romaji reading is accumulated. As shown in FIG. 4A, the record (reading part word) has a reading string as a heading, a pointer connected to the record (notation part word) indicating the corresponding notation, a part of speech, and a conjugation form. It stores grammatical information such as.

【００２３】表記部３４は、漢字などの表記からその表
記に対応する音声データを引くためのデータが集積され
た部分である。そのレコード（表記部単語）は、図４
（ｂ）に示されているとおり、見出しとなる表記列と、
それに対応する音声データが格納されたレコード（音声
部単語）へのポインタと、そのレコードの文法情報等が
格納されている。The notation section 34 is a section in which data for subtracting voice data corresponding to the notation such as Chinese characters is accumulated. The record (notation part word) is shown in FIG.
As shown in (b), a notation string as a heading,
A pointer to a record (speech word) in which the corresponding voice data is stored, grammatical information of the record, and the like are stored.

【００２４】音声部３５は、音声データからその音声デ
ータに対応する表記を引くためのデータが集積された部
分である。そのレコード（音声部単語）は、図４（ｃ）
に示されているとおり、見出しとなる音声データと、そ
れに対応する表記が格納されたレコード（表記部単語）
へのポインタと、そのレコードの文法情報等が格納され
ている。なお、音声データは、デジタル化された音声信
号で良い。The voice part 35 is a part in which data for subtracting a notation corresponding to the voice data from the voice data is accumulated. The record (speech word) is shown in FIG. 4 (c).
As shown in, the record that stores the voice data that serves as a headline and the corresponding notation (notation part word)
A pointer to and the grammatical information of the record are stored. The audio data may be a digitized audio signal.

【００２５】このように、辞書４は、登録されている項
目が同じであり、しかも対応する項目に関する属性が異
なった複数の辞書を組み合わせたものである。すなわ
ち、かな漢字変換では読みと表記との対応付けのために
辞書を用い、音声合成では表記と音声データとの対応付
けのために辞書を用いる。読み，表記，音声とはある１
つの言葉があればそれに対応して必ず存在している。そ
こで、単語を項目として持つ辞書であれば、同一の項目
すなわち単語に関して、その属性すなわち読み，表記，
音声をひとつの辞書にまとめることができる。なお、実
施例では項目としては単語を取り上げているが、それに
限るものではない。As described above, the dictionary 4 is a combination of a plurality of dictionaries having the same registered items and different attributes regarding the corresponding items. That is, in the kana-kanji conversion, a dictionary is used for associating the reading and the notation, and in speech synthesis, the dictionary is used for associating the notation and the voice data. There is reading, notation, and voice 1
If there are three words, they will always exist. Therefore, if it is a dictionary that has words as items, the attributes, that is, reading, notation,
You can combine voices into one dictionary. Although the word is taken up as an item in the embodiment, the item is not limited to this.

【００２６】＜かな漢字変換＞かな漢字変換では、読み
データから読みデータに対応する表記データを検索し、
かな漢字変換を行う。図５にその処理手順のフローチャ
ートを示す。このフローチャートは、図２の記憶装置２
４に格納されたプログラムを、ＣＰＵ２２により実行す
ることで実現される。<Kana-Kanji conversion> In Kana-Kanji conversion, the notation data corresponding to the reading data is searched from the reading data,
Kana-Kanji conversion is performed. FIG. 5 shows a flowchart of the processing procedure. This flowchart is based on the storage device 2 of FIG.
It is realized by the CPU 22 executing the program stored in 4.

【００２７】まず、ステップＳ５１で、入力装置２１よ
り読み列を入力する。読み列はローマ字やかなで入力さ
れ、読みを表わす仮名文字列として処理される。次にス
テップＳ５２でその読み列について、文節や単語の切り
出し等の文法上の解析を行なう。こうして得られた単語
について、ステップＳ５３では、その読みをキーとして
辞書４の読み部３３を検索する。同音語の処理について
は例えば各単語の使用実績に基づいて決めるなどの処理
の仕方が知られており、それらの方法に従えば良い。こ
うしてステップＳ５４で該当する読み部単語を得、ステ
ップＳ５５で、その中に格納された表記へのポインタを
たどって連結された表記部単語を得る。これに格納され
た表記列が求める読みに対応する表記であり、ステップ
Ｓ５６ではその表記列を出力する。First, in step S51, a reading string is input from the input device 21. The reading string is input in Roman characters or kana and is processed as a kana character string representing reading. Next, in step S52, the reading string is analyzed grammatically such as cutting out clauses and words. In step S53, the reading unit 33 of the dictionary 4 is searched for the word thus obtained using the reading as a key. Regarding the processing of homophones, there are known methods of processing such as determining based on the usage record of each word, and these methods may be followed. Thus, in step S54, the corresponding reading part word is obtained, and in step S55, the connected notation part word is obtained by tracing the pointer to the notation stored therein. The notation string stored therein is the notation corresponding to the desired reading, and in step S56 the notation string is output.

【００２８】＜音声合成＞音声合成では、表記データか
ら表記データに対応する音声データを検索し、音声合成
を行う。図６にその処理手順のフローチャートを示す。
このフローチャートは、図２の記憶装置２４に格納され
たプログラムを、ＣＰＵ２２により実行することで実現
される。<Speech Synthesis> In speech synthesis, speech data corresponding to the notation data is retrieved from the notation data and speech synthesis is performed. FIG. 6 shows a flowchart of the processing procedure.
This flowchart is realized by the CPU 22 executing the program stored in the storage device 24 of FIG.

【００２９】まず、ステップＳ６１で、入力装置２１よ
り表記列を入力する。表記列はかなや漢字混じりの文字
列として入力される。ステップＳ６２で、その表記列に
ついて文法上の解析を行なう。こうして得られた単語に
ついて、ステップＳ６３では、その表記をキーとして辞
書４の表記部３４を検索する。こうしてステップＳ６４
でキーの表記に該当する表記部単語を得、ステップＳ６
５で、その中に格納された音声データへのポインタをた
どって、連結された音声部単語を得る。ここに格納され
た音声データが求める表記に対応する音声データであ
り、ステップＳ６６ではその音声データを音声として出
力する。First, in step S61, a notation string is input from the input device 21. The notation string is entered as a character string containing kana and kanji. In step S62, the notation string is grammatically analyzed. In step S63, the word thus obtained is searched in the notation unit 34 of the dictionary 4 using the notation as a key. Thus, step S64
The notation word corresponding to the notation of the key is obtained at step S6
At 5, the pointer to the voice data stored therein is followed to obtain the concatenated voice part word. The voice data stored here is voice data corresponding to the desired notation, and the voice data is output as voice in step S66.

【００３０】＜音声認識＞音声認識では、音声データか
らそれに対応する表記データを検索し、音声認識を行
う。図６にその処理手順のフローチャートを示す。この
フローチャートは、図２の記憶装置２４に格納されたプ
ログラムを、ＣＰＵ２２により実行することで実現され
る。<Voice Recognition> In the voice recognition, the notation data corresponding to the voice data is searched for the voice recognition. FIG. 6 shows a flowchart of the processing procedure. This flowchart is realized by the CPU 22 executing the program stored in the storage device 24 of FIG.

【００３１】まず、ステップＳ７１で、入力装置２１よ
り音声データを入力する。音声データは、マイクロホン
などで電気信号に変換された音声信号が、更にＤ／Ａ変
換器等でデジタル化されている。次いで、ステップＳ７
２で、その音声データについて音節の切り出しなど、必
要な処理を施す。この処理については、従来知られてい
る音声認識の処理に準ずるものでよい。こうして得られ
た音節について、ステップＳ７３では、ステップＳ７２
で得られたデータをキーとして辞書４の音声部３５を検
索する。こうしてステップＳ７４でキーの音声データに
該当する音声部単語を得、ステップＳ７５で、その中に
格納された表記データへのポインタをたどって、連結さ
れた表記部単語を得る。ここに格納された表記列が求め
る音声データに対応する表記列であり、ステップＳ７６
ではその表記データを出力する。First, in step S71, voice data is input from the input device 21. As the audio data, an audio signal converted into an electric signal by a microphone or the like is further digitized by a D / A converter or the like. Then, step S7
In step 2, necessary processing such as cutting out syllables is performed on the voice data. This processing may be similar to the conventionally known speech recognition processing. Regarding the syllable thus obtained, in step S73, step S72
The voice section 35 of the dictionary 4 is searched using the data obtained in step 1 as a key. Thus, in step S74, the voice part word corresponding to the voice data of the key is obtained, and in step S75, the pointer to the notation data stored therein is traced to obtain the connected notation part word. The notation string stored here is the notation string corresponding to the desired audio data, and step S76
Then, the notation data is output.

【００３２】以上のように、本実施例のシステムでは、
１つの辞書を用いて、仮名漢字変換・音声合成・音声認
識の処理を行なうことができる。As described above, in the system of this embodiment,
It is possible to perform kana-kanji conversion / speech synthesis / speech recognition processing using one dictionary.

【００３３】＜辞書の保守＞図２４に辞書４の保守処理
をする手順の一例をフローチャートで示した。<Maintenance of dictionary> FIG. 24 is a flow chart showing an example of the procedure for the maintenance process of the dictionary 4.

【００３４】まず、オペレータが辞書の保守の開始を指
示する。ステップＳ２４１で指定された保守の種類を判
定する。保守の種類は、追加，削除，変更の３種とす
る。First, the operator gives an instruction to start maintenance of the dictionary. The type of maintenance designated in step S241 is determined. There are three types of maintenance: addition, deletion, and modification.

【００３５】削除である場合には、ステップＳ２４２で
削除対象となる単語を入力させる。単語の入力は、その
読み列を入力させれば良いステップＳ２４３では、入力
された読みから読み部単語を得、ポインタをたどって表
記部単語を、表記部単語から更に音声単語を得ることが
できる。同音語の場合には、例えば同音の表記列をすべ
て表示し、オペレータにその中から選ばせれば良い。こ
うして得られた読み部単語，表記部単語，音声部単語を
辞書から削除する。If the word is to be deleted, the word to be deleted is entered in step S242. In order to input a word, it is sufficient to input the reading string. In step S243, the reading word can be obtained from the input reading, and the pointer can be followed to obtain the notation word and further the notation word from the notation word. . In the case of homophones, for example, all the notation strings of homophones may be displayed and the operator may be selected from them. The reading word, the notation word, and the voice word thus obtained are deleted from the dictionary.

【００３６】ステップＳ２４４では、削除された単語に
対応すべく索引部を更新する。In step S244, the index portion is updated to correspond to the deleted word.

【００３７】一方、変更の場合には、ステップＳ２４５
で変更対象となる単語を指定させる。この指定は読みの
指定でよい。次にステップＳ２４６で変更対象となる部
分を指定させる。変更対象となる部分とは、読みを変更
するか、表記を変更するか、音声データを変更するかと
いったことであり、また、それらについての変更後の内
容である。On the other hand, in the case of change, step S245
Use to specify the word to be changed. This designation may be a reading designation. Next, in step S246, the portion to be changed is designated. The part to be changed is whether the reading is changed, the notation is changed, or the voice data is changed, and the contents after the change are changed.

【００３８】変更後の内容が指定されたなら、ステップ
Ｓ２４７で、対象となる単語、すなわち読み部単語か、
表記部単語か、音声部単語か、必要な単語を読み出し、
新たに指定された内容に書き換えて、読み出した単語を
書き換えられた単語に置換する。If the changed contents are designated, in step S247, the target word, that is, the reading part word,
Read notation word, voice part word, necessary word,
The newly specified content is rewritten, and the read word is replaced with the rewritten word.

【００３９】また、追加である場合には、ステップＳ２
４８で追加する単語を入力させる。追加単語は読みと、
表記と、音声とを組として入力する。その後、ステップ
Ｓ２４９で追加された単語を辞書に登録する。この際に
は、同時に読み部単語から表記部単語へのポインタ、表
記部単語から音声部単語へのポインタ、音声部単語から
表記部単語へのポインタを、各単語に書き込むことはも
ちろんである。If it is an addition, step S2
Input the word to be added at 48. The additional words are reading,
Input notation and voice as a set. Then, the word added in step S249 is registered in the dictionary. In this case, it goes without saying that a pointer from the reading word to the notation word, a pointer from the notation word to the speech word, and a pointer from the speech word to the notation word are simultaneously written in each word.

【００４０】最後に、ステップＳ２５０で索引を更新し
ておく。Finally, the index is updated in step S250.

【００４１】以上のように、所望の単語について辞書に
登録された何等かの属性（本実施例では読み）を指定
し、指定された属性から辞書を検索して、前記所望の単
語に関する登録内容（本実施例では読み部・表記部・音
声データ部）を全て追加・削除・変更の対象とすること
で、辞書の一貫性を保ち、保守の手間を減らすことがで
きる。本実施例でいえば、辞書に登録された単語の読
み，表記，音声について、一時に更新することができ
る。すなわち、かな漢字変換辞書，音声合成・認識辞書
何れかの辞書の保守は、他方の辞書の対応する単語に関
する保守ともなる。このため、複数の辞書を保守する手
間を軽減でき、また、複数の辞書間の一貫性を保つこと
ができる。As described above, some attribute (reading in this embodiment) registered in the dictionary for the desired word is designated, the dictionary is searched from the designated attribute, and the registered content relating to the desired word is specified. By adding / deleting / changing all (reading part / notation part / voice data part in this embodiment), it is possible to maintain the consistency of the dictionary and reduce the maintenance time. According to the present embodiment, the reading, notation, and voice of words registered in the dictionary can be updated at a time. That is, maintenance of either the Kana-Kanji conversion dictionary or the voice synthesis / recognition dictionary is maintenance of the corresponding word in the other dictionary. Therefore, it is possible to reduce the trouble of maintaining the plurality of dictionaries and maintain the consistency between the plurality of dictionaries.

【００４２】なお、図２４の説明では、削除・変更の場
合には。読みを入力させるようにしたが、表記あるいは
音声データを入力して、それに該当する単語を保守させ
ることもできる。この場合には読みに連結するポインタ
がないため、索引を用いて検索することななる。In the description of FIG. 24, in the case of deletion / change. Although the pronunciation is input, the notation or voice data can be input to maintain the corresponding word. In this case, since there is no pointer connected to the reading, the index will be used for the search.

【００４３】以上説明したように、本実施例の装置は、
１つの辞書を用いて仮名漢字変換・音声合成・音声認識
を行なうことができる。また、この辞書は、読み，表
記，音声それぞれを見出しとしてポインタで対応付けら
れているため、辞書中の項目に関して重複がなく、辞書
の保守の際には最小限の変更で済む。As described above, the device of this embodiment is
Kana-to-Kanji conversion, voice synthesis, and voice recognition can be performed using one dictionary. In addition, since this dictionary is associated with pointers using reading, notation, and voice as headings, there is no overlap in the items in the dictionary, and minimal changes are required when maintaining the dictionary.

【００４４】[0044]

【第２実施例】次に本発明の第２の実施例である、かな
漢字変換・文字認識機能を備える情報処理システムにつ
いて説明する。Second Embodiment Next, an information processing system having a kana-kanji conversion / character recognition function, which is a second embodiment of the present invention, will be described.

【００４５】本実施例の情報処理システムの構成は、第
１実施例と同じく図２のブロック図で示される。The configuration of the information processing system of this embodiment is shown in the block diagram of FIG. 2 as in the first embodiment.

【００４６】図８は本実施例のシステムに備えられたモ
ジュールのブロック図である。図８のように、１つの辞
書８３を、かな漢字変換部８１，文字認識部８２により
使用する。各モジュールによる処理は記憶装置２４に格
納されたプログラムをＣＰＵ２２が実行することで実現
される。また、辞書８３は記憶装置２４に格納されてい
る。FIG. 8 is a block diagram of modules included in the system of this embodiment. As shown in FIG. 8, one dictionary 83 is used by the kana-kanji conversion unit 81 and the character recognition unit 82. The processing by each module is realized by the CPU 22 executing a program stored in the storage device 24. The dictionary 83 is stored in the storage device 24.

【００４７】＜辞書の構成＞図９は辞書８３の一例の全
体構成を示した図である。辞書８３は、文字索引９１と
文字認識データ索引９２と読み部９３と表記部９４と文
字認識データ部９５とを含んでいる。図において、文字
索引９１は、キーとなる読みあるいは表記から、該当す
るレコードを検索するための索引であり、文字認識デー
タ索引９２はキーとなる文字認識データからから、それ
に該当するレコードを検索するための索引である。<Structure of Dictionary> FIG. 9 is a diagram showing the overall structure of an example of the dictionary 83. The dictionary 83 includes a character index 91, a character recognition data index 92, a reading unit 93, a notation unit 94, and a character recognition data unit 95. In the figure, a character index 91 is an index for retrieving the corresponding record from the key reading or notation, and a character recognition data index 92 is retrieving the corresponding record from the key character recognition data. It is an index for.

【００４８】読み部９３は、かなやローマ字による読み
からその表記を引くためのデータが集積された部分であ
る。そのレコード（読み部単語）は、図１０（ａ）に示
されているとおり、見出しとなる読み列と、それに対応
する表記を示すレコード（表記部単語）へと繋がるポイ
ンタと、品詞や活用形などの文法情報等とが格納されて
いる。The reading part 93 is a part in which data for subtracting the notation from the kana or romaji reading is accumulated. As shown in FIG. 10A, the record (reading part word) has a reading string that serves as a heading, a pointer connected to a record (notation part word) indicating the corresponding notation, a part of speech, and a conjugation form. It stores grammatical information such as.

【００４９】表記部９４は、漢字などの表記からその表
記に対応する音声データを引くためのデータが集積され
た部分である。そのレコード（表記部単語）は、図１０
（ｂ）に示されているとおり、見出しとなる表記列と、
そのレコードの文法情報等が格納されている。The notation section 94 is a section in which data for subtracting voice data corresponding to the notation such as Chinese characters is accumulated. The record (notation part word) is shown in FIG.
As shown in (b), a notation string as a heading,
The grammatical information of the record is stored.

【００５０】文字認識データ部９５は、文字認識データ
からそのデータに対応する表記を引くためのデータが集
積された部分である。そのレコード（文字認識データ部
単語）は、図１０（ｃ）に示されているとおり、見出し
となる文字認識データと、それに対応する表記が格納さ
れたレコード（表記部単語）へのポインタと、そのレコ
ードの文法情報等が格納されている。なお、文字認識デ
ータは、文字パターンから抽出された特徴などを表わす
データで良い。The character recognition data part 95 is a part in which data for subtracting a notation corresponding to the data from the character recognition data is accumulated. As shown in FIG. 10C, the record (character recognition data section word) has a character recognition data to be a headline and a pointer to a record (notation section word) in which the corresponding notation is stored, The grammatical information of the record is stored. The character recognition data may be data representing features extracted from the character pattern.

【００５１】＜かな漢字変換＞かな漢字変換では、読み
データから読みデータに対応する表記データを検索し、
かな漢字変換を行う。図５にその処理手順のフローチャ
ートを示す。このフローチャートは、第１実施例で説明
されているため、説明は省略する。<Kana-Kanji conversion> In Kana-Kanji conversion, the notation data corresponding to the reading data is searched from the reading data,
Kana-Kanji conversion is performed. FIG. 5 shows a flowchart of the processing procedure. Since this flowchart has been described in the first embodiment, the description will be omitted.

【００５２】＜文字認識＞文字認識では、文字認識デー
タからそれに対応する表記データを検索し、文字認識を
行う。図１１にその処理手順のフローチャートを示す。
このフローチャートは、図２の記憶装置２４に格納され
たプログラムを、ＣＰＵ２２により実行することで実現
される。<Character Recognition> In character recognition, character recognition data is searched for corresponding notation data to perform character recognition. FIG. 11 shows a flowchart of the processing procedure.
This flowchart is realized by the CPU 22 executing the program stored in the storage device 24 of FIG.

【００５３】まず、ステップＳ１１１で、画像スキャナ
などの入力装置２１より、文字画像のデータを入力す
る。次いで、ステップＳ１１２で、その文字画像データ
についての特徴の抽出など、必要な処理を施し、文字認
識データを得る。この処理については、従来知られてい
る文字認識の処理に準ずるものでよい。こうして得られ
た文字認識データについて、ステップＳ１１３では、ス
テップＳ１１２で得られたデータをキーとして辞書８３
の文字認識データ部９５を検索する。こうしてステップ
Ｓ１１４でキーの文字認識データに該当する文字認識デ
ータ部単語を得、ステップＳ１１５で、その中に格納さ
れた表記データへのポインタをたどって、連結された表
記部単語を得る。ここに格納された表記列が求める文字
認識データに対応する表記列であり、ステップＳ１１６
ではその表記データを出力する。First, in step S111, character image data is input from the input device 21 such as an image scanner. Next, in step S112, necessary processing such as extraction of characteristics of the character image data is performed to obtain character recognition data. This processing may be similar to the conventionally known character recognition processing. Regarding the character recognition data thus obtained, in step S113, the dictionary 83 is created using the data obtained in step S112 as a key.
The character recognition data section 95 is searched. Thus, in step S114, the character recognition data part word corresponding to the character recognition data of the key is obtained, and in step S115, the pointer to the notation data stored therein is traced to obtain the connected notation part word. The notation string stored here is the notation string corresponding to the desired character recognition data, and step S116
Then, the notation data is output.

【００５４】以上のような手順により、１つの辞書を用
いてかな漢字変換・文字認識を行なうことができる。な
お、辞書の保守の手順は、第１実施例の図２４で説明し
たと同じ要領で行なう。このため、１つの辞書の変更は
同時に他の辞書の変更となるため、辞書間の統一性を保
つことができ、保守の手間を省くことができる。With the above procedure, kana-kanji conversion / character recognition can be performed using one dictionary. The procedure for maintaining the dictionary is the same as that described with reference to FIG. 24 of the first embodiment. Therefore, the change of one dictionary is simultaneously changed of the other dictionaries, so that the consistency between the dictionaries can be maintained and maintenance work can be omitted.

【００５５】[0055]

【第３実施例】次に本発明の第３の実施例である、かな
漢字変換・漢字かな変換・かな漢字混じり文解析機能を
備える情報処理システムについて説明する。[Third Embodiment] Next, an information processing system having a kana-kanji conversion / kanji-kana conversion / kana-kanji mixed sentence analysis function according to a third embodiment of the present invention will be described.

【００５６】本実施例の情報処理システムの構成は、第
１実施例と同じく図２のブロック図で示される。The configuration of the information processing system of this embodiment is shown in the block diagram of FIG. 2 as in the first embodiment.

【００５７】図１２は本実施例のシステムに備えられた
モジュールのブロック図である。図１２のように、１つ
の辞書１２４を、かな漢字変換部１２１，漢字かな変換
部１２２，漢字かな混じり文解析部１２３により使用す
る。各モジュールによる処理は記憶装置２４に格納され
たプログラムをＣＰＵ２２が実行することで実現され
る。また、辞書１２４は記憶装置２４に格納されてい
る。FIG. 12 is a block diagram of modules provided in the system of this embodiment. As shown in FIG. 12, one dictionary 124 is used by the kana-kanji conversion unit 121, the kanji-kana conversion unit 122, and the kanji-kana mixed sentence analysis unit 123. The processing by each module is realized by the CPU 22 executing a program stored in the storage device 24. The dictionary 124 is stored in the storage device 24.

【００５８】＜辞書の構成＞図１２は辞書１２４の全体
構成の一例を示した図である。辞書１２４は、文字索引
部１３１と単語部１３２とから構成されている。単語部
１３２は読み単語部と表記単語部とに別れており、それ
ぞれ図１４（ａ）に示す読み単語と（ｂ）に示す表記単
語とからなる。読み単語は読み列をキーとして表記列を
引くためのものであり、表記単語は表記列をキーとして
読みを引くためのものである。各単語は、例えば読み列
あるいは表記列のコードをキーとして整列されている。<Dictionary Configuration> FIG. 12 is a diagram showing an example of the overall configuration of the dictionary 124. The dictionary 124 is composed of a character index part 131 and a word part 132. The word portion 132 is divided into a reading word portion and a notation word portion, each of which is composed of the reading word shown in FIG. 14A and the notation word shown in FIG. 14B. The reading word is for drawing a notation string with the reading string as a key, and the notation word is for drawing a reading with the notation string as a key. For example, the words are arranged by using the code in the reading column or the notation column as a key.

【００５９】それぞれの単語の内容が図１４に示されて
いる。図のように、各単語とも、読みと表記の対応の
他、文法情報を有している。この形式の辞書では、保守
に際しては、第１実施例のようにポインタをたどること
はできないが、互いに対応する読み部単語と表記部単語
とを登録しておけば、読みか表記のいずれか一方を指定
することで両方を追加・削除・変更することができる。The contents of each word are shown in FIG. As shown in the figure, each word has grammatical information in addition to the correspondence between reading and notation. In this type of dictionary, the pointer cannot be traced for maintenance as in the first embodiment, but if the reading part word and the notation part word corresponding to each other are registered, either the reading or the notation can be performed. Both can be added / deleted / changed by specifying.

【００６０】なお、辞書の構成は図１５に示した様なも
のであっても良い。図１５では、辞書は、索引１５１と
読み部１５２と表記部１５３とから構成されている。読
み部単語１６１は図１６（ａ）に示されたもので、読み
列と、それに対応する表記部単語へのポインタと、文法
情報とを有する。また、表記部単語１６２は、表記列
と、対応する読み部単語へのポインタと、文法情報とを
有する。この辞書１２４の構成は、第１実施例の辞書４
の構成から音声部を除き、表記部からのポインタを音声
部ではなく読み部としたものである。従って辞書１２４
の保守の手順も第１実施例の辞書４と同じ要領で行な
う。The structure of the dictionary may be as shown in FIG. In FIG. 15, the dictionary includes an index 151, a reading unit 152, and a writing unit 153. The reading part word 161 is shown in FIG. 16A, and has a reading string, a pointer to the corresponding notation part word, and grammar information. The notation word 162 has a notation string, a pointer to the corresponding reading word, and grammar information. The structure of this dictionary 124 is the same as the dictionary 4 of the first embodiment.
The voice part is removed from the configuration of the above, and the pointer from the notation part is not the voice part but the reading part. Therefore, the dictionary 124
The maintenance procedure is also performed in the same manner as the dictionary 4 of the first embodiment.

【００６１】更に、辞書１２４の構成は、図１７に示し
たものであっても良い。図１７の辞書は、辞書に登録さ
れた読み列のコードと表記のコードとが異なる例であ
る。例えば、読み列あるいは単語列をＪＩＳコードに従
って作成する場合には、１文字に１バイトが必要とされ
る。ところが、読み列に限っていえば、それを仮名で表
記すれば１文字あたり１バイトで十分である。そこで、
読み列を１バイトコードで、表記列を２バイトコードで
表わすようにすれば辞書を小さくすることができる。読
み部単語および表記部単語はそれぞれ図１８（ａ），
（ｂ）に示されている。構成自体は図１６と同じだが、
読み列はコードＡで、表記列は別のコードＢで記述され
ている。Furthermore, the structure of the dictionary 124 may be that shown in FIG. The dictionary of FIG. 17 is an example in which the reading column code and the notation code registered in the dictionary are different. For example, when a reading string or a word string is created according to the JIS code, one character requires 1 byte. However, if it is limited to the reading string, one byte for each character is sufficient if it is written in kana. Therefore,
The dictionary can be made smaller by representing the reading string with a 1-byte code and the notation string with a 2-byte code. The reading part word and the notation part word are respectively shown in FIG.
It is shown in (b). The configuration itself is the same as in Fig. 16, but
The reading string is described by the code A, and the notation string is described by another code B.

【００６２】＜かな漢字変換＞かな漢字変換では、読み
データから読みデータに対応する表記データを検索し、
かな漢字変換を行う。図５にその処理手順のフローチャ
ートを示す。このフローチャートは、第１実施例で説明
されているため、説明は省略する。ただし、図１３の辞
書を用いる場合には、ポインタをたどる必要はなく、読
みに対応する表示は同じレコードに格納されている。<Kana-Kanji conversion> In Kana-Kanji conversion, the notation data corresponding to the reading data is searched from the reading data,
Kana-Kanji conversion is performed. FIG. 5 shows a flowchart of the processing procedure. Since this flowchart has been described in the first embodiment, the description will be omitted. However, when the dictionary of FIG. 13 is used, it is not necessary to follow the pointer, and the display corresponding to reading is stored in the same record.

【００６３】＜漢字かな変換＞漢字かな変換は、漢字混
じりの文を、例えば仮名で表記された読み列に変換する
ものである。図１９にその処理手順のフローチャートを
示した。<Kanji-kana conversion> The kanji-kana conversion is for converting a sentence containing kanji into a reading string written in, for example, kana. FIG. 19 shows a flowchart of the processing procedure.

【００６４】まず、ステップＳ１９１で、入力装置２１
より表記列を入力する。表記列は漢字混じりの文で入力
される。次に、ステップＳ１９２で、その表記列につい
て、単語の切り出し等の解析を行なう。こうして得られ
た単語について、ステップＳ１９３では、その表記列の
コードをキーとして辞書１２４を検索する。こうしてス
テップＳ１９４で該当する単語を得、ステップＳ１９５
で、その中に格納された読み列、あるいはポインタをた
どって連結された読み列を得る。この読み列が求める表
記に対応する読みであり、ステップＳ１９６ではその読
み列を出力する。First, in step S191, the input device 21
Enter the notation sequence. The notation string is entered as a sentence containing mixed kanji. Next, in step S192, the notation string is analyzed such as cutting out words. In step S193, the dictionary 124 is searched for the words thus obtained using the code of the notation string as a key. Thus, the corresponding word is obtained in step S194, and step S195
Then, the reading string stored therein or the reading string connected by tracing the pointer is obtained. This reading string is a reading corresponding to the notation required, and the reading string is output in step S196.

【００６５】＜漢字かな文解析＞漢字かな混じり文解析
は、表記データ、あるいは読みデータから、それに対応
する文法情報を得、文を解析するものである。図２０に
その処理手順のフローチャートを示した。<Kanji / Kana Sentence Analysis> The Kanji / Kana mixed sentence analysis obtains grammatical information corresponding to the written data or reading data and analyzes the sentence. FIG. 20 shows a flowchart of the processing procedure.

【００６６】まず、ステップＳ２０１で、入力装置２１
より表記列を入力する。表記列は漢字混じりの文で入力
される。次に、ステップＳ２０２で、表記列によって辞
書を検索する。かなによる単語が対象であれば、読み列
から辞書を検索しても良い。こうしてステップＳ２０３
で検索対象の単語列あるいは読み列に該当する単語を取
得し、ステップＳ２０３で取得した単語に格納された文
法情報を得ることができる。この文法情報を用いて、ス
テップＳ２０５で入力された表記列を解析する。First, in step S201, the input device 21
Enter the notation sequence. The notation string is entered as a sentence containing mixed kanji. Next, in step S202, the dictionary is searched by the notation string. If the word of Kana is the target, the dictionary may be searched from the reading string. Thus, step S203
The word corresponding to the word string or the reading string to be searched can be acquired with, and the grammatical information stored in the word acquired in step S203 can be obtained. Using this grammar information, the notation string input in step S205 is analyzed.

【００６７】以上のような手順により、１つの辞書を用
いてかな漢字変換・漢字かな変換・漢字かな文解析を行
なうことができる。なお、辞書の保守の手順は、第１実
施例の図２４で説明したと同様の要領で行なう。このた
め、１つの辞書の変更は同時に他の辞書の変更となり、
辞書間の統一性を保つことができ、保守の手間を省くこ
とができる。With the above procedure, kana-kanji conversion, kanji-kana conversion, and kanji-kana sentence analysis can be performed using one dictionary. The procedure for maintaining the dictionary is the same as that described with reference to FIG. 24 of the first embodiment. Therefore, changes in one dictionary will be changes in the other dictionary at the same time,
Uniformity between dictionaries can be maintained and maintenance work can be omitted.

【００６８】[0068]

【第４実施例】次に本発明の第４の実施例である、かな
漢字変換・音声合成・音声認識・文字認識機能を備える
情報処理システムについて説明する。Fourth Embodiment Next, an information processing system having a kana-kanji conversion / voice synthesis / voice recognition / character recognition function, which is a fourth embodiment of the present invention, will be described.

【００６９】本実施例の情報処理システムの構成は、第
１実施例と同じく図２のブロック図で示される。The configuration of the information processing system of this embodiment is shown in the block diagram of FIG. 2 as in the first embodiment.

【００７０】図２１は本実施例のシステムに備えられた
モジュールのブロック図である。図２１のように、１つ
の辞書２１５を、かな漢字変換部２１１，音声合成部２
１２，音声認識部２１３，文字認識部２１４により使用
する。各モジュールによる処理は記憶装置２４に格納さ
れたプログラムをＣＰＵ２２が実行することで実現され
る。また、辞書２１５は記憶装置２４に格納されてい
る。FIG. 21 is a block diagram of modules included in the system of this embodiment. As shown in FIG. 21, one dictionary 215 is used as a kana-kanji conversion unit 211 and a voice synthesis unit 2.
12, used by the voice recognition unit 213 and the character recognition unit 214. The processing by each module is realized by the CPU 22 executing a program stored in the storage device 24. The dictionary 215 is also stored in the storage device 24.

【００７１】＜辞書の構成＞図２２は辞書２１５の全体
構成の一例を示した図である。辞書２１５は、文字索引
部２２５と音声索引部２２６と読み部２２１と表記部２
２２と音声部２２３と文字認識データ部２２４とから構
成されている。読み部２２１は仮名などで表記された読
みを見出しとするレコード（単語）が格納されており、
読みと表記とを対応づけている。表記部２２２には漢字
などの表記を見出しとするレコードが格納されており、
表記と音声と対応づけている。音声部２２３には音声デ
ータを見出しとするレコードが格納されており、これを
表記と対応づけている。文字認識データ部単２２４には
文字認識データが格納されており、これと表記とを対応
づけている。<Structure of Dictionary> FIG. 22 shows an example of the overall structure of the dictionary 215. The dictionary 215 includes a character index unit 225, a voice index unit 226, a reading unit 221, and a writing unit 2.
22 and a voice unit 223 and a character recognition data unit 224. The reading unit 221 stores a record (word) whose heading is reading written in kana or the like,
The reading and the notation are associated with each other. The notation section 222 stores records having headings for notations such as Chinese characters.
Corresponds to notation and voice. The voice section 223 stores a record having voice data as a headline, which is associated with the notation. The character recognition data unit 224 stores character recognition data, which is associated with the notation.

【００７２】それぞれの単語の内容が図２３（ａ）〜
（ｄ）に示されている。各単語には見出しと、それに対
応する単語へのポインタと、文法情報とが格納されてい
る。この辞書２１５を用いたかな漢字変換・音声合成・
音声認識・文字認識処理については、第１実施例および
第２実施例で説明したと同じ手順であり、その説明は省
略するが、１つの辞書を用いてかな漢字変換・音声合成
・音声認識・文字認識を行なうことができる。なお、辞
書の保守の手順は、第１実施例の図２４で説明したと同
様の要領で行なう。このため、１つの辞書の変更は同時
に他の辞書の変更となり、辞書間の統一性を保つことが
でき、保守の手間を省くことができる。The contents of each word are shown in FIG.
It is shown in (d). Each word stores a headline, a pointer to the corresponding word, and grammar information. Kana-Kanji conversion / speech synthesis using this dictionary 215
The voice recognition / character recognition processing is the same procedure as described in the first and second embodiments, and the description thereof will be omitted, but using one dictionary, kana-kanji conversion / voice synthesis / voice recognition / character Can recognize. The procedure for maintaining the dictionary is the same as that described with reference to FIG. 24 of the first embodiment. For this reason, the change of one dictionary becomes the change of the other dictionary at the same time, and the consistency between the dictionaries can be maintained, and the trouble of maintenance can be saved.

【００７３】このように、第１〜第４実施例と同じ要領
で種々の辞書を一つにまとめてそれを使用することや、
それを保守することができ、それによって辞書の統一性
を保持し、辞書の保守を簡単にし、辞書の大きさを縮小
することができる。また、単語に限らず、同じ項目が登
録された尚、本発明は、複数の機器から構成されるシス
テムに適用しても、１つの機器から成る装置に適用して
も良い。また、本発明はシステム或は装置にプログラム
を供給することによって達成される場合にも適用できる
ことは言うまでもない。As described above, various dictionaries are combined into one and used in the same manner as in the first to fourth embodiments.
It can be maintained, thereby maintaining the integrity of the dictionary, simplifying the maintenance of the dictionary and reducing the size of the dictionary. Further, not only words but the same items are registered. The present invention may be applied to a system including a plurality of devices or an apparatus including a single device. Further, it goes without saying that the present invention can be applied to the case where it is achieved by supplying a program to a system or an apparatus.

【００７４】[0074]

【発明の効果】以上説明したように、本発明に係る文書
処理装置は、複数の用途に供する辞書の容量を小さく抑
え、その内容の統一性を保持し、保守を容易にすること
ができるという効果がある。As described above, the document processing apparatus according to the present invention can reduce the capacity of the dictionary used for a plurality of purposes, maintain the uniformity of the contents, and facilitate the maintenance. effective.

【００７５】[0075]

[Brief description of drawings]

【図１】第１実施例の文書処理システムの構成を示すブ
ロック図である。FIG. 1 is a block diagram illustrating a configuration of a document processing system according to a first embodiment.

【図２】本発明の実施例である情報処理システムの構成
を表すブロック図である。FIG. 2 is a block diagram showing a configuration of an information processing system that is an embodiment of the present invention.

【図３】第１実施例の辞書の全体構成図である。FIG. 3 is an overall configuration diagram of a dictionary according to a first embodiment.

【図４】第１実施例の辞書の単語の構成を示す図であ
る。FIG. 4 is a diagram showing a structure of words in a dictionary according to the first embodiment.

【図５】かな漢字変換処理手順のフローチャートであ
る。FIG. 5 is a flowchart of a kana-kanji conversion processing procedure.

【図６】音声合成処理手順のフローチャートである。FIG. 6 is a flowchart of a voice synthesis processing procedure.

【図７】音声認識処理手順のフローチャートである。FIG. 7 is a flowchart of a voice recognition processing procedure.

【図８】第２実施例の文書処理システムの構成を示すブ
ロック図である。FIG. 8 is a block diagram showing a configuration of a document processing system according to a second exemplary embodiment.

【図９】第２実施例の辞書の全体構成図である。FIG. 9 is an overall configuration diagram of a dictionary according to a second embodiment.

【図１０】第２実施例の辞書の単語の構成を示す図であ
る。FIG. 10 is a diagram showing a structure of words in a dictionary according to a second embodiment.

【図１１】文字認識処理手順のフローチャートである。FIG. 11 is a flowchart of a character recognition processing procedure.

【図１２】第３実施例の文書処理システムの構成を示す
ブロック図である。FIG. 12 is a block diagram showing a configuration of a document processing system of a third exemplary embodiment.

【図１３】第３実施例の辞書の全体構成図である。FIG. 13 is an overall configuration diagram of a dictionary according to a third embodiment.

【図１４】第３実施例の辞書の単語の構成を示す図であ
る。FIG. 14 is a diagram showing a structure of words in a dictionary according to a third embodiment.

【図１５】第３実施例の他の構成の辞書の全体構成図で
ある。FIG. 15 is an overall configuration diagram of a dictionary having another configuration according to the third embodiment.

【図１６】図１５に示された辞書の単語の構成を示す図
である。16 is a diagram showing a structure of words in the dictionary shown in FIG. 15. FIG.

【図１７】第３実施例の他の構成の辞書の全体構成図で
ある。FIG. 17 is an overall configuration diagram of a dictionary having another configuration according to the third embodiment.

【図１８】図１７に示された辞書の単語の構成を示す図
である。FIG. 18 is a diagram showing a structure of words in the dictionary shown in FIG. 17;

【図１９】漢字かな変換処理手順のフローチャートであ
る。FIG. 19 is a flowchart of a kanji-kana conversion processing procedure.

【図２０】漢字かな文解析処理手順のフローチャートで
ある。FIG. 20 is a flowchart of a kanji / kana sentence analysis processing procedure.

【図２１】第４実施例の文書処理システムの構成を示す
ブロック図である。FIG. 21 is a block diagram showing the configuration of a document processing system according to a fourth embodiment.

【図２２】第４実施例の辞書の全体構成図である。FIG. 22 is an overall configuration diagram of a dictionary according to a fourth embodiment.

【図２３】第４実施例の辞書の単語の構成を示す図であ
る。FIG. 23 is a diagram showing a structure of words in a dictionary according to a fourth embodiment.

【図２４】辞書の保守の処理手順を示すフローチャート
である。１かな漢字変換部、２音声合成部、３音声認識部、４辞書、２１入力装置、２２中央演算処理装置、２３出力装置、２４記憶装置である。FIG. 24 is a flowchart showing a processing procedure of dictionary maintenance. 1 Kana-Kanji conversion unit, 2 voice synthesis unit, 3 voice recognition unit, 4 dictionary, 21 input device, 22 central processing unit, 23 output device, 24 storage device.

Claims

[Claims]

1. A dictionary in which a plurality of attributes related to a word are registered as headings of corresponding words, and a plurality of attributes related to the same word are associated with each other, and other words corresponding to predetermined attributes of a desired word A document processing apparatus comprising: a search unit that searches the attribute of the item from the dictionary; and a unit that performs a subsequent process based on the attribute searched by the search unit.

2. A designating unit for designating maintenance for the dictionary, and a maintenance unit for maintaining the attribute of the word retrieved by the retrieval unit as designated by the designating unit.
The document processing apparatus according to claim 1, further comprising:

3. The designating means designates either addition or deletion of a word or change of attribute, and designation of an attribute of a target word, and the maintenance means is designated in case of addition. Registered attributes as headings in the dictionary, in the case of deletion, all attributes related to the word corresponding to the specified attribute are deleted from the dictionary, and in case of change, the specified attribute is changed to the specified value. The document processing apparatus according to claim 2, wherein:

4. The plurality of attributes include reading, notation, and voice data, and the processing in the next stage includes kana-kanji conversion, voice synthesis, and voice recognition. The document processing device according to any one of the above.

5. The method according to claim 1, wherein the plurality of attributes include reading, notation, and character recognition data, and the processing in the next stage includes character recognition and kana-kanji conversion. Document processing device described.

6. The plurality of attributes include reading and notation, voice data, and character recognition data, and the processing in the next stage includes kana-kanji conversion, voice synthesis, voice recognition, and character recognition. The document processing apparatus according to claim 1.