JPH0424885A

JPH0424885A - Reading processor

Info

Publication number: JPH0424885A
Application number: JP2130874A
Authority: JP
Inventors: Masahisa Yano; 矢野　雅久; Yoshiyuki Yamashita; 山下　義征
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1990-05-21
Filing date: 1990-05-21
Publication date: 1992-01-28

Abstract

PURPOSE:To improve reading environment by extracting a color character picture corresponding to a color characteristic as the object of reading from a medium on which a color character is recorded, and outputting it to a character recognizing part. CONSTITUTION:In the case of the medium designated for the reading, a picture input part 1 outputs its picture signal to a color characteristic storage part 3, and the color characteristic storage part measures and stores the color characteristic, and in the case of the document of the object of the reading, the picture input part 1 outputs its picture signal to a picture extracting part 4, and the picture extracting part 4 extracts the character picture corresponding to the color characteristic or the character picture in a color mark area as the object of the reading on the basis of the color characteristic stored in the color characteristic storage part 3, and outputs it to the character recognizing part 5. Then, the character recognizing part 5 executes the inclination correction, etc., of the extracted document picture, and segments the charac ter picture by every one character, and recognizes and converts it into a character code, and a voice synthesizing part 6 converts this character code into voice waveform, and a voice is outputted from a voice output part 7. Thus, the reading environment is improved.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は文書、書籍上の文字を認識結果に基づいて朗読
音声を出力する読書処理装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a reading processing device that outputs a reading voice based on the recognition result of characters on a document or book.

[Prior art]

従来この種の読書処理装置としては、電子情報通信学会
技術研究報告　信学技報Ｖｏ１．８７Ｎｏ、３３４　１
９８８年１月２２日発行　Ｐ７９〜８６に開示されたも
のがあった。第２図は上記従来のこの種の読書処理装置
の合成を示すブロック図である。図示するように読書処
理装置は、画像入力部２１、画像処理部２２、文字認識
部２３、音声合成部２４及び音声出力部２５を具備する
構成である。Conventionally, this type of reading processing device was published in IEICE Technical Report Vol. 1.87 No. 334 1.
Published on January 22, 1988, there was something disclosed on pages 79 to 86. FIG. 2 is a block diagram showing the synthesis of the above-mentioned conventional reading processing device of this type. As shown in the figure, the reading processing device includes an image input section 21, an image processing section 22, a character recognition section 23, a voice synthesis section 24, and a voice output section 25.

画像入力部２１で、文書、書籍をページ単位で入力し、
この画像に対して、画像処理部２２で傾き補正等の画像
処理を行ない、文字認識部２３で出力された文書画像文
字の切出し、認識を行ない最終的な出力として音声出力
を行なうものであった。The image input unit 21 inputs documents and books page by page,
An image processing section 22 performs image processing such as tilt correction on this image, a character recognition section 23 cuts out and recognizes the output document image characters, and outputs audio as the final output. .

[Problem to be solved by the invention]

しかしながら上記構成の従来の読書処理装置では、健常
者が文書、書籍を読む時、日常的に行なっている下記の
ような読み方の選択が不可能であった。また、盲人や肢
体障害者などの障害者にとって良い読書環境であるとは
言えなかった。However, with the conventional reading processing device having the above-mentioned configuration, it has been impossible for healthy people to select the reading method described below, which they do on a daily basis when reading documents and books. In addition, it could not be said that it was a good reading environment for people with disabilities such as blind people or physically disabled people.

例えば、読書対象の文書、書籍が論文などの場合、（１）題名や執筆者などの書誌的事項を読む（２）要約
を読む（３）章２節等の題名を読む（４〉参考文献を読む（５）上記を知った上で全文を読むか読まないかを判断
する。For example, if the document or book you are reading is an essay, (1) Read the bibliographic information such as the title and author, (2) Read the summary, (3) Read the title of chapter 2, etc., (4) References. (5) Knowing the above, decide whether to read the entire text or not.

本発明は上述の点に鑑みてなされたもので上記問題点を
除去するため、文書、書籍のカラーマーク部分の文章を
自由に選択して読み取ることができるようにし、晴眼者
が日常的に行なっている文書書籍の拾い読みに近い形の
読み方が出来、障害者の読書環境の向上に役立つ読書処
理装置を提供することを目的とする。The present invention has been made in view of the above-mentioned problems, and in order to eliminate the above-mentioned problems, it is possible to freely select and read the text in the color mark part of documents and books, so that sighted people can read it on a daily basis. To provide a reading processing device capable of reading in a manner similar to the browsing method of reading documents and books, which is useful for improving the reading environment for handicapped people.

[Means to solve the problem]

上記課題を解決するため本発明は読書処理装置を下記の
ように構成した。In order to solve the above problems, the present invention has configured a reading processing device as follows.

媒体上を走査し画像信号を出力する画像入力部と、読書指定媒体上の色特性を測定して記憶する色特性記憶
部と、カラー文字が記載された媒体から、色特性に対応するカ
ラー文字画像を読み上げ対象として抽出し文字認識部に
出力する画像抽出部を設けたことを特徴とする。An image input section that scans the medium and outputs an image signal, a color characteristic storage section that measures and stores the color characteristics on the designated reading medium, and a color character corresponding to the color characteristics from the medium on which the color characters are written. The present invention is characterized in that it includes an image extraction unit that extracts an image as a reading target and outputs it to a character recognition unit.

また、媒体はカラーマークが追記又は印刷されたもので
あって、画像抽出部は前記色特性に基づいて、カラーマ
ークを検出し、当該カラーマーク領域内の文字画像を読
み上げ対象として抽出することを特徴とする。Further, the medium is one on which a color mark is added or printed, and the image extraction unit detects the color mark based on the color characteristics and extracts the character image within the color mark area as a reading target. Features.

また、読書指定媒体は読書対象種別を表す、慨字が記載
されたカードとしたことを特徴とする。Further, the reading designation medium is characterized by being a card on which a letter indicating the type of reading object is written.

[Effect]

本発明の読書処理装置は上記の如く構成するので、読書
指定媒体の場合には画像入力部はその画像信号を色特性
記憶部に出力し、色特性記憶部は色特性を測定し記憶す
る。読書対象文書の場合には画像入力部はその画像信号
を画像抽出部に出力し、画像抽出部は色特性記憶部に記
憶された色特性を基に、該色特性に対応するカラー文字
画像又はカラーマーク領域内の文字画像を読み上げ対象
として抽出し文字認識部に出力する。文字認識部は抽出
された文書画像に傾き補正等の処理を施し、１文字毎、
文字画像を切出し、認識し、文字コードに変換し、音声
合成部はこの文字コードを音声波形に変換し、音声出力
部から音声が出力される。Since the reading processing device of the present invention is configured as described above, in the case of a medium designated for reading, the image input section outputs the image signal to the color characteristic storage section, and the color characteristic storage section measures and stores the color characteristics. In the case of a document to be read, the image input section outputs the image signal to the image extraction section, and the image extraction section extracts a color character image or a color character image corresponding to the color characteristics based on the color characteristics stored in the color characteristics storage section. Character images within the color mark area are extracted as objects to be read out and output to the character recognition unit. The character recognition unit performs processing such as tilt correction on the extracted document image, and
A character image is cut out, recognized, and converted into a character code, and a speech synthesis section converts this character code into an audio waveform, and audio is output from an audio output section.

即ち、本発明の読書処理装置は健常者が文書書籍の拾い
読みを行なう手順で文書書籍の所望の部分に重要度に合
わせたカラーマークを追記又は印刷しておき、カラーマ
ーク部分の文章を自由に選択して読み上げるので、晴眼
者が日常的に行なっている文書書籍の拾い読みに極力近
い形の読み取りができる。That is, the reading processing device of the present invention adds or prints a color mark according to the importance to a desired part of a document book in the process of browsing the document book by a healthy person, and then freely edits the text in the color mark portion. Since the text is selected and read aloud, it is possible to read in a manner as close as possible to the browsing of documents and books that sighted people do on a daily basis.

読書指定媒体は読書対象種別を表す点字が記載きれたカ
ードとすることにより、盲人等の障害者の読書環境の向
上にも役立つ。By using a card with Braille characters indicating the type of reading target as the designated reading medium, it is also useful for improving the reading environment for people with disabilities such as the blind.

〔Example〕

以下、本発明の一実施例を図面に基づいて説明する。 Hereinafter, one embodiment of the present invention will be described based on the drawings.

第１図は本発明の実施例である読書処理装置の構成を示
すブロック図である。図示するように、本読書処理装置
は、画像入力部１、読取モード切換部２、色特性記憶部
３、画像抽出部４、文字認識部５、音声合成部６及び音
声出力部７を具備する構成である。FIG. 1 is a block diagram showing the configuration of a reading processing device that is an embodiment of the present invention. As shown in the figure, the reading processing device includes an image input section 1, a reading mode switching section 2, a color characteristic storage section 3, an image extraction section 4, a character recognition section 5, a speech synthesis section 6, and a speech output section 7. It is the composition.

画像入力部１は、第４図、第５図に示すような用紙を主
走査を水平方向（ＸＸ）、副走査を垂直方向（ＹＹ）と
して走査し、用紙上に記載されているマーク及び／又は
文書９図面の濃淡及び色の一方又は双方に応じた画像信
号、例えば濃度多値レベル信号又はＲ（赤）、Ｇ（緑）
、Ｂ（青）の多値レベル信号を読取モード切換部２へ出
力する。The image input unit 1 scans a sheet of paper as shown in FIGS. 4 and 5 with main scanning in the horizontal direction (XX) and sub-scanning in the vertical direction (YY), and records marks and/or marks written on the paper. Or an image signal corresponding to one or both of the shading and color of the document 9 drawing, such as a density multilevel signal or R (red), G (green)
, B (blue) multilevel signals are output to the reading mode switching section 2.

読取モード切換部２は、図示しないスイッチ等を用いて
前記画像入力部１からの画像信号の出力光を色特性記憶
部３又は画像抽出部４へと切換える。読取モード切換部
２は、第４図に示すような読書カードの場合には画像入
力部１からの画像信号は色特性記憶部３へ出力し、第５
図に示すような読書対象文書の場合には画像抽出部４へ
出力する。The reading mode switching section 2 switches the output light of the image signal from the image input section 1 to the color characteristic storage section 3 or the image extraction section 4 using a switch or the like (not shown). In the case of a reading card as shown in FIG. 4, the reading mode switching unit 2 outputs the image signal from the image input unit 1 to the color characteristic storage unit 3,
In the case of a document to be read as shown in the figure, it is output to the image extraction section 4.

色特性記憶部３は、第４図に示すような読書カードの所
定の領域（ｍｓ　）の色特性を測定し、記憶する。The color characteristic storage section 3 measures and stores the color characteristics of a predetermined area (ms) of the reading card as shown in FIG.

画像抽出部４は、前記色特性に基づいて第５図に示すよ
うな文書、書籍上に記載されているカラーマーク領域内
の文書画像を読み上げ対象として抽出する。The image extraction unit 4 extracts a document image within a color mark area written on a document or book as shown in FIG. 5 as a reading target based on the color characteristics.

文字認識部５は、抽出された文書画像に傾き補正等の処
理を施し、１文字毎、文字画像を切出し、認識し、文字
コードに変換する。The character recognition unit 5 performs processing such as tilt correction on the extracted document image, cuts out the character image for each character, recognizes it, and converts it into a character code.

文字認識部５は、前記文字コードを音声波形に変換し、
音声出力部７に出力し、該音声出力部７から音声が出力
される。The character recognition unit 5 converts the character code into a voice waveform,
The signal is output to the audio output section 7, and the audio is output from the audio output section 7.

第３図は読書カード及び読書対象文書マークを色毎に分
類するための原理を示す色特性の説明図であって、同図
（ａ）はマークの色座標の範囲を示す色度図、同図（ｂ
）はマークの濃度範囲の説明図である。FIG. 3 is an explanatory diagram of color characteristics showing the principle for classifying reading cards and document marks to be read by color; FIG. Figure (b
) is an explanatory diagram of the density range of the mark.

第３図（ａ）に示す原理を用いる場合、画像入力部１に
は、例えばＲＧＢ系の濃度多値レベル信号を出力できる
カラースキャナが必要である。When using the principle shown in FIG. 3(a), the image input section 1 requires a color scanner capable of outputting, for example, an RGB density multilevel signal.

第３図（ｂ）に示す原理を用いる場合、画像入力部１は
濃度多値レベル信号を出力できるモノクロスキャナ又は
上記カラースキャナが必要である。When using the principle shown in FIG. 3(b), the image input section 1 requires a monochrome scanner or the above-mentioned color scanner capable of outputting a multilevel density signal.

第３図（ａ）の色度図は公知の方法により作成したもの
を概略的に示したものであって、ＲＧＢ系の濃度信号を
座標変換式〈１）を用いてＸＹＺ系に変換し、Ｘ　＝　２．７６８９　Ｒ＋　１．７５１７　Ｇ　＋　
１．１３０２　ＢＹ＝　　　　　Ｒ＋４．５９０７Ｇ＋
０．０６０１ＢＺ＝　　　　　　０．０５６５Ｇ＋５．
５９４３Ｂ　　　　（１）求められた、ｘ、ｙ、ｚをも
とに式（２）を用いて色度座標ｘ　ｒ　ｙを計算し、ｘ＝Ｘ／（Ｘ十Ｙ＋Ｚ）ｙ＝ｙ／（ｘ＋ｙ−＋−ｚ）　　　　　　　　　（２）
求められた色座標Ｘ　Ｔ　３’による直交座標を用いた
ものである。前記ｘ、ｙは色相と彩度を表すものであり
、Ｙは明度を表わすものである。The chromaticity diagram in FIG. 3(a) is a schematic diagram created by a known method, in which the RGB density signal is converted to the XYZ system using the coordinate conversion formula <1), X = 2.7689 R+ 1.7517 G+
1.1302 BY=R+4.5907G+
0.0601BZ=0.0565G+5.
5943B (1) Based on the obtained x, y, and z, calculate the chromaticity coordinates x r y using equation (2), and x = X / (X + Y + Z) y = y / (x + y - +-z) (2)
This uses orthogonal coordinates based on the obtained color coordinates X T 3'. The x and y represent hue and saturation, and Y represents lightness.

上述のＲＧＢ系からＸＹＺ系への変換〔式（１）を用い
る〕色相と彩度を表わすＸ、７を求める計算〔式（２）
を用いる〕はそれぞれＲ，Ｇ、Ｂを入力し、ｘ、ｙ、ｚ
を出力するＲＯＭを用いて、及びｘ、ｙ、ｚを入力し、
ｘ、ｙを整数形式で出力するＲＯＭを用いて変換するこ
とによって、計算時間が不要となり、処理の高速化がは
かれる。Conversion from the above RGB system to the XYZ system [using formula (1)] Calculation to obtain X, 7 representing hue and saturation [formula (2)]
], input R, G, B, x, y, z
using a ROM that outputs and inputs x, y, z,
By converting using a ROM that outputs x and y in integer format, calculation time is not required and processing speed can be increased.

第３図（ｂ）の場合、領域抽出対象の文書、図面が白、
黒で表現されたものであって、原稿上のマークの濃度レ
ベルが前記白、黒の濃度と重ならない濃度レベルであっ
て、前記マークの色が色毎に互いに重ならない濃度レベ
ルを持つ色を選択することによって色の分類が可能とな
る。また、前記Ｙを同様に表現してもよい。In the case of Fig. 3(b), the document or drawing to be extracted is white;
A color that is expressed in black, the density level of the mark on the document is a density level that does not overlap with the density of the white and black, and the color of the mark has a density level that does not overlap with each other for each color. Color classification is possible by selection. Moreover, the above Y may be expressed similarly.

第１図の色特性記憶部３に記憶する色特性は前述の原理
に限定されるものではなく、他の公知の色特性に変換し
たものを用いてもよい力松本発明では説明簡略化のため
に第３図（ａ）に示す色度図を用いて記憶されるものと
する。The color characteristics stored in the color characteristics storage section 3 in FIG. It is assumed that the chromaticity diagram is stored using the chromaticity diagram shown in FIG. 3(a).

第４図は読書カードの１例であって、Ｍｌは書籍種別を
示す目視可能な文字列、Ｍ２は読み上げ対象領域に付与
した名称を示す目視可能な文字列、ＴＩ、Ｔ２はそれぞ
れ文字列Ｍｌ、Ｍ２に対応した点字列、ｍｓは文字列Ｍ
ｌ　、Ｍ２に対応して、文書、書籍の読み上げ対象領域
に追記または印刷されているカラーマークと同じ色特性
を持つカラーマークである。FIG. 4 is an example of a reading card, where Ml is a visible character string indicating the type of book, M2 is a visible character string indicating the name given to the area to be read out, and TI and T2 are each a character string Ml. , the Braille string corresponding to M2, ms is the character string M
This is a color mark that has the same color characteristics as a color mark that is added or printed in the reading target area of a document or book, corresponding to M2.

読書カードは第４図に示すように構成されているので、
健常者、視覚障害者いずれにも簡単に操作可能であり装
置使用者が任意に作成することも可能である。The reading card is structured as shown in Figure 4, so
It can be easily operated by both healthy people and visually impaired people, and can be created arbitrarily by the user of the device.

また、書籍種別を前記論文の他の特許、文庫本、雑誌、
社内文書、新聞等とし、読み上げ対象領域は前記書籍種
別毎に最適な名前を付与した読書カードを用意しておく
ことによって読書対象をも広げることが可能となる。In addition, the book type can be changed to other patents, paperbacks, magazines, etc.
In-house documents, newspapers, etc. can be read aloud, and by preparing a reading card with an optimal name assigned to each book type, the range of reading objects can be expanded.

なお、以下の説明において、読書カードは論文を対象と
し、領域名称は「要約」、「書誌的事項」、１章１節の
題目」、「参考文献」、カラーマークの色は各々緑、赤
、青、黄とする。In the following explanation, reading cards are for papers, and the area names are "Abstract", "Bibliographical matters", "Title of chapter 1, section 1", "References", and the color marks are green and red, respectively. , blue, and yellow.

次に、色特性記憶部３の動作について説明する。色特性
記憶部３はＲＧＢ系の画像信号から得られる色度座標ｘ
、ｙを整数化、例えば１００倍して整数化したＸ％、ｙ
％をアドレスとする２次元メモリ（カラーマツプ）を持
ち、第４図に示すような読書指定カードの読み込みが開
始されると、前記カラーマツプと全文読取フラグを１０
．に初期化する。続いて走査が所定の領域（ｍ　ｓ　）
に達すると入力きれるＲＧＢ系の画像信号を整数化した
Ｘ％、ｙ％に変換し、このＸ％１３’％をアドレスとす
るカラーマツプに１１」を書き込む。なお、所定の領域
（ｍ　ｓ　）が黒である場合、即ち画像信号Ｒ，Ｇ、Ｂ
全てが一定レベル以下の場合、全文読取フラグをセット
する。Next, the operation of the color characteristic storage section 3 will be explained. The color characteristic storage unit 3 stores chromaticity coordinates x obtained from RGB image signals.
, convert y into an integer, for example, multiply it by 100 and convert it into an integer, X%, y
It has a two-dimensional memory (color map) whose address is %, and when reading of a reading designation card as shown in Fig. 4 starts, the color map and full text reading flag are set to 10.
．． Initialize to . Next, scanning is performed on a predetermined area (m s )
When it reaches , the RGB image signal that can be inputted is converted into integers of X% and y%, and 11'' is written in the color map with this X%13'% as the address. Note that if the predetermined area (m s ) is black, that is, the image signals R, G, B
If all are below a certain level, set the full text reading flag.

次に、画像抽出部４のマーク識別処理、マーク領域内の
画像抽出処理について、第５図〜第６図を用いて説明す
る。Next, the mark identification process and the image extraction process in the mark area by the image extraction section 4 will be explained using FIGS. 5 and 6.

先ず、マーク識別処理は次のようにして行なう。画像入
力部１が第５図に示すような用紙に対して主走査を水平
方向（ＸＸ）、副走査を垂直方向（ＹＹ）として走査し
て出力されるＲＧＢの多値レベル信号を前述の式（１）
　、　（２）を用いて色度座標ｘ、ｙに変換し、該ｘ、
ｙを整数化したＸ％。First, mark identification processing is performed as follows. The image input unit 1 scans a sheet of paper as shown in FIG. 5 with main scanning in the horizontal direction (XX) and sub-scanning in the vertical direction (YY), and outputs an RGB multi-level signal using the above formula. (1)
, (2) to convert to chromaticity coordinates x, y, and the x,
X%, which is y converted to an integer.

ｙ％を用いて色特性記憶部３のカラーマツプを参照し、
該カラーマツプが「１」の場合、走査点がマーク上にあ
ると識別する。Refer to the color map in the color characteristic storage unit 3 using y%,
If the color map is "1", it is identified that the scanning point is on the mark.

マーク領域内の画像抽出処理について第６図を用いて説
明する。第６図において、副走査ＹＹｉ行ではマーク画
像は検出されないが、ＹＹ２行ではカラーマツプに記憶
きれた色特性ｊのマーク画像が前述のマーク識別処理に
よって識別され、更にＹＹａ行では前記マーク画像に連
結し、且つ挾まれた画像（ＸＸ３１　ｗ−ＸＸ３２　ｂ
　）が存在するため、当該挾まれた画像を読書対象の画
像として画像抽出部４から文字認識部５に出力する。前
述の手順でマーク画像に挾まれた画像を読書対象の画像
として出力しているが、副走査ＹＹｎ行に達すると連結
マーク画像が存在しなくなるので読書対象の画像の抽出
は終了する。The image extraction process within the mark area will be explained using FIG. 6. In FIG. 6, no mark image is detected in the sub-scanning row YYi, but in the YY2 row, the mark image with the color characteristic j that has been stored in the color map is identified by the mark identification process described above, and furthermore, in the YYa row, the mark image is connected to the mark image. and the sandwiched image (XX31 w-XX32 b
), the image extraction section 4 outputs the interposed image to the character recognition section 5 as an image to be read. In the above-described procedure, the image sandwiched between the mark images is output as an image to be read, but when the sub-scanning line YYn is reached, there is no longer a connected mark image, so the extraction of the image to be read is completed.

なお、マーク画像が連結しているか否かの判定は公知の
技術例えば特公昭６０−５５８６８号公報に詳述しであ
るように、下式（３）を用いて行なう。It should be noted that the determination as to whether or not the mark images are connected is made using the following equation (3) using a known technique, for example, as detailed in Japanese Patent Publication No. 60-55868.

ＸＸ（ｉ−１）ｋｂ≦ＸＸｉｋｗＸＸ（ｉ−１）ｋｗ≦ＸＸ１ｋｂ　　　　　（３）但し
、ＸＸ（ｉ−１）ｋｂ、ＸＸ（ｉ−１）ｋｗは各々走査
行ＹＹｉ行の前走査行ＹＹｉ−１の色特性「ＯＪから色
特性「ｊＪへの変化点及び色特性「ｊ」から色特性「０
」への変化点ペアのＸ座標を示す、ＸＸ１ｋｂ　、ＸＸ
ｉ　ｋｗは各々走査行Ｙｉの色特性ｒ□、から前記色特
性と同一の色特性１ｊ」への変化点及び前記色特性「ｊ
、から色特性「ＯＪへの変化点ベアのＸ座標を示す。但
し、色特性記憶部３に全文読取フラグがセットａれてい
る場合前述の処理は行なわず、画像入力部１から出力さ
れ全画像を読み上げ対象の画像として画像抽出部４から
出力する。XX(i-1)kb≦XXikw XX(i-1)kw≦XX1kb (3) However, XX(i-1)kb and XX(i-1)kw are respectively the previous scanning row YYi- of the scanning row YYi row. 1, the change point from color characteristic “OJ” to color characteristic “jJ” and color characteristic “j” to color characteristic “0”
XX1kb, XX indicating the X coordinate of the pair of change points to ``
i kw is the change point from the color characteristic r□ of the scanning row Yi to the same color characteristic 1j and the color characteristic ``j'' of the scanning row Yi, respectively.
, indicates the X coordinate of the change point bear from the color characteristic "OJ". However, if the full text reading flag is set a in the color characteristic storage section 3, the above processing is not performed and the entire image is output from the image input section 1. The image is output from the image extraction unit 4 as an image to be read out.

なお、読書対象の文書、書籍が無彩色の場合には、画像
抽出部４に無彩色画像抽出手段を設けておき、文字認識
部２３から出力される画像は無彩色画像のみとしておく
ことにより、マーク画像による影響を除去することが可
能である。Note that when the document or book to be read is achromatic, the image extraction section 4 is provided with an achromatic image extraction means, and the image output from the character recognition section 23 is only an achromatic image. It is possible to remove the influence of mark images.

前記無彩色画像抽出手段は、公知の方法、例えばＲ，Ｇ
、Ｈの信号が全て同一に近いレベルにあることによって
実現する。The achromatic image extraction means uses a known method, for example, R, G
, H signals are all at nearly the same level.

また、前記無彩色抽出手段には２値化手段を接統し、２
値化された画像を読書対象の画像とする。なお、２値化
のための閾値は公知の如何なる方法によってもよい。Further, the achromatic color extraction means is connected with a binarization means,
Let the digitized image be the image to be read. Note that the threshold value for binarization may be determined using any known method.

読書対象に第５図に示すように１要約」を指定し、第６
図に示す文書を入力した場合、マークｍ２に囲まれた画
像が読書対象の画像として出力きれ、当該画像から文字
の切出し、認識を行なった後音声合成され朗読音声が出
力される。読み上げ対象に１書誌的事項」や１章１節の
題名」が指定された場合も同様な処理が行なわれる。As shown in Figure 5, specify ``Summary 1'' as the reading target, and
When the document shown in the figure is input, the image surrounded by the mark m2 is output as the image to be read, and after characters are extracted and recognized from the image, voice synthesis is performed and a reading voice is output. A similar process is performed when "1 bibliographic item" or "title of 1 chapter, 1 section" is specified as the object to be read out.

なお、読書カード及び文書、書籍に追記又は印刷するマ
ークの位置、大きき７色、数、形状は前述のものに限定
されるものではない。Note that the position, size, seven colors, number, and shape of marks to be added or printed on reading cards, documents, and books are not limited to those described above.

前述の説明では書籍上のカラーマーク部分を選択するも
のとしたが、カラーマークの代りに文字の色を変えて印
刷した書籍を用意しておき、当該文字の色を選択するよ
う構成してもよい。読書領域毎に異なる色の文字が印刷
されている場合、画像抽出部４は、前述のマーク識別処
理、マーク領域内画像抽出処理が不要となり、設定され
た読取色特性の画像を直接読取対象の画像として抽出す
る。In the above explanation, it is assumed that the color mark part on the book is selected, but it is also possible to prepare a book printed with the text in a different color instead of the color mark, and configure it to select the color of the text. good. When characters of different colors are printed in each reading area, the image extraction unit 4 does not need the above-mentioned mark identification processing and mark area image extraction processing, and directly extracts the image with the set reading color characteristics from the reading target. Extract as an image.

また、読取モード切換部２に下記の機能を追加すること
によって、切換動作の自動化が可能となる。Furthermore, by adding the following functions to the reading mode switching unit 2, automation of the switching operation becomes possible.

（１）読書カードの大きさを文書書籍と比較して小びく
すると共に、用紙の大きさ比較ができる機能。(1) A function that allows you to compare the size of a reading card with a text book and compare the size of the paper.

（２〉読書カードの所定の位置に判別マークを付加する
と共に判別マークの有無を検出する機能。(2> A function that adds a discrimination mark to a predetermined position on a reading card and detects the presence or absence of a discrimination mark.

（３）読書カードの所定の位置に切り欠きを設け、その
切り欠きを検出する機能。(3) A function to provide a notch at a predetermined position on a reading card and detect the notch.

（４）読書カード専用の入力場所を設け（画像入力位置
は同一）その入力場所を通過するか否かを検出する機能
。(4) A function to provide an input location exclusively for reading cards (the image input location is the same) and detect whether the input location is passed through.

〔Effect of the invention〕

以上詳細に説明したように本発明によれは、健常者が文
書書籍の拾い読みを行なう手順で文書書籍の色特性に対
応するカラー画像又は文書書籍の所望部分に重要度に合
わせた色特性に対応するカラーマークを追記または印刷
しておき、文書書籍のカラーマーク部分の文書を自由に
選択して抽出し、読み上げるので、健常者が日常的に行
なっている文書書籍の拾い読みに極力近い形の読み方が
でき、盲人や肢体障害者等の障害者の読書環境の向上に
役立つと共に、健常者においても、視覚による読書環境
と略同じ読書環境を与えることができる読書処理装置を
提供できるという優れた効果が得られる。As explained in detail above, according to the present invention, a color image corresponding to the color characteristics of the document or a desired part of the document can be applied to a color characteristic according to the importance level in a procedure in which an able-bodied person browses through the document. The user can add or print out the color marks to be printed, and then freely select and extract the documents in the color marked portion of the book and read them aloud. The present invention has the excellent effect of being able to provide a reading processing device that is useful for improving the reading environment for people with disabilities such as blind people and people with physical disabilities, and can also provide a reading environment that is almost the same as a visual reading environment even for able-bodied people. is obtained.

[Brief explanation of drawings]

第１図は本発明の実施例の読書処理装置の構成を示すブ
ロック図、第２は従来の実施例の読書処の説明図、同図
（ｂ）はマークの濃度範囲の説明図、第４図は読書カー
ドの例を示す図、第５図は読書対象文書の例を示す図、
第６図は画像抽出の説明図である。図中、１・・・・画像入力部、２・・・・読取モード切
換部、３・・・・色特性記憶部、４・・・・画像抽出部
、５・・・・文字認識部、６・・・・音声合成部、７・・・・音声出力部。特許出願人　沖電気工業株式会社４へ゛１友弁理士　熊　谷　隆（外１名）マー７す」し
曳訃固め討明ａＩ！！糟社の説明ｌ第３図１ｅｉ＊−）−ｓｑ″ＩｉＬｙｒ−Ｋｍ第４図第５図一化イ亀ネｉｉ±の會）−日月６０第６図FIG. 1 is a block diagram showing the configuration of the reading processing device according to the embodiment of the present invention, FIG. 2 is an explanatory diagram of the reading area of the conventional embodiment, FIG. The figure shows an example of a reading card, and FIG. 5 shows an example of a document to be read.
FIG. 6 is an explanatory diagram of image extraction. In the figure, 1...image input unit, 2...reading mode switching unit, 3...color characteristic storage unit, 4...image extraction unit, 5...character recognition unit, 6...Speech synthesis unit, 7...Speech output unit. Patent applicant: Oki Electric Industry Co., Ltd. 4: Friendly patent attorney: Takashi Kumagai (1 other person) ! Explanation of Kasusha Figure 3 1ei*-)-sq''IiLyr-Km Figure 4 Figure 5 Meeting of Ika Ikamen ii±)-Sun/Monday 60 Figure 6

Claims

[Claims]

(1) In a reading processing device that includes a character recognition unit that recognizes characters on a medium and a voice synthesis unit that synthesizes voice based on the recognition result recognized by the character recognition unit, and outputs the recognition result as voice. , an image input section that scans the medium and outputs an image signal; a color characteristic storage section that measures and stores the color characteristics on the designated reading medium; A reading processing device comprising: an image extracting section that extracts a corresponding color character image as a reading target and outputs it to the character recognition section.

(2) The medium has a color mark added or printed thereon, and the image extraction unit detects the color mark based on the color characteristics and extracts the character image within the color mark area as a reading target. The reading processing device according to claim 1, characterized in that:

(3) The reading processing device according to claim 1 or 2, wherein the reading designation medium is a card on which Braille characters representing the type of reading object are written.