JP2000137765A5 - - Google Patents

Download PDF

Info

Publication number
JP2000137765A5
JP2000137765A5 JP1998311464A JP31146498A JP2000137765A5 JP 2000137765 A5 JP2000137765 A5 JP 2000137765A5 JP 1998311464 A JP1998311464 A JP 1998311464A JP 31146498 A JP31146498 A JP 31146498A JP 2000137765 A5 JP2000137765 A5 JP 2000137765A5
Authority
JP
Japan
Prior art keywords
character
image
character recognition
recognition result
threshold value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP1998311464A
Other languages
Japanese (ja)
Other versions
JP2000137765A (en
Filing date
Publication date
Application filed filed Critical
Priority to JP10311464A priority Critical patent/JP2000137765A/en
Priority claimed from JP10311464A external-priority patent/JP2000137765A/en
Publication of JP2000137765A publication Critical patent/JP2000137765A/en
Publication of JP2000137765A5 publication Critical patent/JP2000137765A5/ja
Pending legal-status Critical Current

Links

Description

【特許請求の範囲】
【請求項1】
原稿画像中の文字候補画像部分について文字認識する文字認識手段と、
各文字候補画像部分について前記文字認識手段による文字認識結果の尤度が所定の閾値より高いか否か判断する判断手段と、
前記判断手段により文字認識結果の尤度が前記閾値より高いと判断された文字候補画像部分は、当該文字認識結果に該当する文字コードで出力する文字コード出力手段と
前記判断手段により文字認識結果の尤度が前記閾値より高くないと判断された文字候補画像部分は、画像データで出力する画像出力手段と
を備えることを特徴とする画像処理装置。
【請求項2】
更に、前記文字コード出力手段で出力された文字コードに基づく文字パターンを生成する文字パターン生成手段と、
前記文字パターン生成手段で生成された文字パターンと前記画像出力手段で出力された画像データとを合成して可視画像を形成する画像形成手段と
を備えることを特徴とする請求項第1項に記載の画像処理装置。
【請求項3】
前記判断手段では、前記文字候補画像部分についての特徴量と文字認識辞書に格納されている標準特徴量とに基づいて算出される距離値が、所定の閾値より小さい場合は前記文字認識結果の尤度が高いと判断し、前記所定の閾値以上である場合は前記文字認識結果の尤度が高くないと判断することを特徴とする請求項第1項に記載の画像処理装置。
【請求項4】
更に、前記判断手段により文字認識結果の尤度が高いと判断された複数の文字候補画像部分に基づいて、字体を判定する字体判定手段を備えることを特徴とする請求項第1項に記載の画像処理装置。
【請求項5】
更に、前記判断手段により文字認識結果の尤度が高いと判断された複数の文字候補画像部分に基づいて、字体を判定する字体判定手段を備え、
前記文字パターン発生手段は当該判定された字体を用いて前記文字コードに基づく文字パターンを生成することを特徴とする請求項第2項に記載の画像処理装置。
【請求項6】
ユーザの指示に基づいて字体複写モードの設定を行う字体複写設定手段を更に有し、
前記字体複写設定手段により前記字体複写モードに設定されている場合に、前記字体判定手段で字体を判定し、且つ前記文字パターン発生手段で当該判定された字体を用いて前記文字コードに基づく文字パターンを生成することを特徴とする請求項第5項に記載の画像処理装置。
【請求項7】
ユーザの指示に基づいて字体変換モードの設定を行う字体変換設定手段を更に有し、
前記字体変換設定手段により前記字体変換モードに設定されている場合、前記文字パターン発生手段は、予め設定された字体を用いて前記文字コードに基づく文字パターンを生成することを特徴とする請求項第2項に記載の画像処理装置。
【請求項8】
更に、前記所定の閾値の値をマニュアル操作によって設定する手段を有することを特徴とする請求項第3項に記載の画像処理装置。
【請求項9】
更に、前記原稿画像の文字の占める割合を検出する検出手段と、
該検出手段で検出された割合に応じて前記所定の閾値を自動調整する調整手段と
を備えることを特徴とする請求項第3項に記載の画像処理装置。
【請求項10】
前記検出手段は、水平或いは/及び、垂直方向のドットのヒストグラムを生成するヒストグラム作成手段を含み、
前記調整手段は、前記ヒストグラム作成手段で作成したヒストグラムにおける最少頻度値に基づいて前記所定の閾値を調整することを特徴とする請求項第9項に記載の画像処理装置。
【請求項11】
前記検出手段は、原稿画像中の濃度分布を検出する手段を含み、
前記調整手段は、前記濃度分布に応じて前記所定の閾値を調整することを特徴とする請求項第9項に記載の画像処理装置。
【請求項12】
ユーザの指示に基づいて認識複写モードの設定を行う認識複写モード設定手段を更に有し、
前記認識複写モード設定手段により前記認識複写モードに設定されている場合は、前記文字認識手段と前記判断手段と前記文字コード出力手段と前記画像出力手段と前記文字パターン生成手段と前記画像形成手段とによる処理を実行することによって、前記原稿画像の文字候補画像部分の少なくとも一部が前記生成された文字パターンで合成された前記可視画像を出力し、
前記認識複写モード設定手段により前記認識複写モードに設定されていない場合は、前記原稿画像に対して通常の複写処理が行われることにより形成された複写画像を出力することを特徴とする請求項第2項に記載の画像処理装置。
【請求項13】
原稿画像中の文字候補画像部分について文字認識する文字認識工程と、
各文字候補画像部分について前記文字認識工程による文字認識結果の尤度が所定の閾値より高いか否か判断する判断工程と、
前記判断工程で文字認識結果の尤度が前記閾値より高いと判断された文字候補画像部分は、当該文字認識結果に該当する文字コードで出力する文字コード出力工程と
前記判断工程で文字認識結果の尤度が前記閾値より高くないと判断された文字候補画像部分は、画像データで出力する画像出力工程と
の各工程を、画像処理装置が自動的に実行するように制御することを特徴とする画像処理方法。
【請求項14】
コンピュータが読込み実行することで、原稿画像中の文字候補画像部分について文字認識する画像処理装置として機能させるためのプログラムコードを格納した記憶媒体であって、
各文字候補画像部分について前記文字認識による文字認識結果の尤度が所定の閾値より高いか否か判断する判断手段と、
前記判断手段により文字認識結果の尤度が前記閾値より高いと判断された文字候補画像部分は、当該文字認識結果に該当する文字コードで出力する文字コード出力手段と
前記判断手段により文字認識結果の尤度が前記閾値より高くないと判断された文字候補画像部分は、画像データで出力する画像出力手段と
の各手段を有する画像処理装置として、前記コンピュータを機能させるためのプログラムコードを格納した記憶媒体。
[Claims]
[Claim 1]
Character recognition means for character recognition of character candidate image parts in manuscript images,
A means for determining whether or not the likelihood of the character recognition result by the character recognition means is higher than a predetermined threshold value for each character candidate image portion, and
The character candidate image portion for which the likelihood of the character recognition result is determined to be higher than the threshold value by the determination means is the character recognition result of the character code output means that outputs the character code corresponding to the character recognition result and the determination means. An image processing apparatus characterized in that a character candidate image portion determined to have a likelihood not higher than the threshold value includes an image output means for outputting image data.
2.
Further, a character pattern generation means for generating a character pattern based on the character code output by the character code output means, and a character pattern generation means.
The first aspect of the claim is characterized by comprising an image forming means for forming a visible image by synthesizing a character pattern generated by the character pattern generating means and image data output by the image output means. Image processing equipment.
3.
In the determination means, if the distance value calculated based on the feature amount of the character candidate image portion and the standard feature amount stored in the character recognition dictionary is smaller than a predetermined threshold value, the character recognition result is likely. The image processing apparatus according to claim 1, wherein it is determined that the degree is high, and if it is equal to or higher than the predetermined threshold value, the likelihood of the character recognition result is not high.
4.
The first aspect of the claim is characterized by comprising a font determination means for determining a font based on a plurality of character candidate image portions determined by the determination means to have a high likelihood of a character recognition result. Image processing device.
5.
Further, a font determination means for determining a font based on a plurality of character candidate image portions determined by the determination means to have a high likelihood of a character recognition result is provided.
The image processing apparatus according to claim 2, wherein the character pattern generating means generates a character pattern based on the character code using the determined character font.
6.
It also has a font copy setting means for setting the font copy mode based on the user's instruction.
When the font copy setting means is set to the font copy mode, the font is determined by the font determination means, and the character pattern based on the character code is used by the character pattern generating means. The image processing apparatus according to claim 5, wherein the image processing apparatus is generated.
7.
It also has a font conversion setting means for setting the font conversion mode based on the user's instruction.
The first aspect of the present invention is that when the font conversion setting means is set to the font conversion mode, the character pattern generating means generates a character pattern based on the character code using a preset character font. The image processing apparatus according to item 2.
8.
The image processing apparatus according to claim 3, further comprising means for manually setting the value of the predetermined threshold value.
9.
Further, a detection means for detecting the proportion of characters in the original image and
The image processing apparatus according to claim 3, further comprising an adjusting means for automatically adjusting the predetermined threshold value according to the ratio detected by the detecting means.
10.
The detection means includes a histogram-creating means that produces a histogram of dots in the horizontal and / and vertical directions.
The image processing apparatus according to claim 9, wherein the adjusting means adjusts the predetermined threshold value based on the minimum frequency value in the histogram created by the histogram creating means.
11.
The detection means includes means for detecting a density distribution in a manuscript image.
The image processing apparatus according to claim 9, wherein the adjusting means adjusts the predetermined threshold value according to the concentration distribution.
12.
It also has a recognition copy mode setting means for setting the recognition copy mode based on the user's instruction.
When the recognition copy mode is set by the recognition copy mode setting means, the character recognition means, the determination means, the character code output means, the image output means, the character pattern generation means, and the image forming means. By executing the process according to the above, the visible image in which at least a part of the character candidate image portion of the original image is synthesized by the generated character pattern is output.
The first aspect of the present invention is that when the recognition copy mode is not set by the recognition copy mode setting means, a copy image formed by performing a normal copy process on the original image is output. The image processing apparatus according to item 2.
13.
A character recognition process that recognizes characters in the character candidate image part of the manuscript image,
For each character candidate image portion, a determination step of determining whether or not the likelihood of the character recognition result by the character recognition process is higher than a predetermined threshold value, and
The character candidate image portion for which the likelihood of the character recognition result is determined to be higher than the threshold value in the determination step is the character code output step of outputting the character code corresponding to the character recognition result and the character recognition result in the determination step. The character candidate image portion determined to have a likelihood not higher than the threshold value is characterized in that the image processing apparatus controls each step of the image output step of outputting the image data so as to be automatically executed. Image processing method.
14.
It is a storage medium that stores a program code for functioning as an image processing device that recognizes characters in a character candidate image part in a manuscript image by being read and executed by a computer.
A means for determining whether or not the likelihood of the character recognition result by the character recognition for each character candidate image portion is higher than a predetermined threshold value, and
The character candidate image portion for which the likelihood of the character recognition result is determined to be higher than the threshold value by the determination means is the character recognition result of the character code output means that outputs the character code corresponding to the character recognition result and the determination means. The character candidate image portion determined to have a likelihood not higher than the threshold is stored as an image processing device having each means with an image output means for outputting image data, and stores a program code for operating the computer. Medium.

この課題を解決するため、たとえば本発明の画像処理装置は以下の構成を備える。すなわち、
原稿画像中の文字候補画像部分について文字認識する文字認識手段と、
各文字候補画像部分について前記文字認識手段による文字認識結果の尤度が所定の閾値より高いか否か判断する判断手段と、
前記判断手段により文字認識結果の尤度が前記閾値より高いと判断された文字候補画像部分は、当該文字認識結果に該当する文字コードで出力する文字コード出力手段と
前記判断手段により文字認識結果の尤度が前記閾値より高くないと判断された文字候補画像部分は、画像データで出力する画像出力手段とを備える。
In order to solve this problem, for example, the image processing apparatus of the present invention has the following configuration. That is,
Character recognition means for character recognition of character candidate image parts in manuscript images,
A means for determining whether or not the likelihood of the character recognition result by the character recognition means is higher than a predetermined threshold value for each character candidate image portion, and
The character candidate image portion for which the likelihood of the character recognition result is determined to be higher than the threshold value by the determination means is the character recognition result of the character code output means that outputs the character code corresponding to the character recognition result and the determination means. The character candidate image portion determined to have a likelihood not higher than the threshold value includes an image output means for outputting image data.

JP10311464A 1998-10-30 1998-10-30 Processor and method for image processing and storage medium Pending JP2000137765A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP10311464A JP2000137765A (en) 1998-10-30 1998-10-30 Processor and method for image processing and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP10311464A JP2000137765A (en) 1998-10-30 1998-10-30 Processor and method for image processing and storage medium

Publications (2)

Publication Number Publication Date
JP2000137765A JP2000137765A (en) 2000-05-16
JP2000137765A5 true JP2000137765A5 (en) 2005-12-15

Family

ID=18017550

Family Applications (1)

Application Number Title Priority Date Filing Date
JP10311464A Pending JP2000137765A (en) 1998-10-30 1998-10-30 Processor and method for image processing and storage medium

Country Status (1)

Country Link
JP (1) JP2000137765A (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103257954B (en) * 2013-06-05 2016-08-10 北京百度网讯科技有限公司 The proofreading method of word, system and check and correction server in ancient books
JP6269256B2 (en) * 2014-03-31 2018-01-31 京セラドキュメントソリューションズ株式会社 Information processing apparatus, image forming apparatus, information processing method, and information processing program

Similar Documents

Publication Publication Date Title
US6208744B1 (en) Document image processor and method for setting a document format conforming to a document image
US6798906B1 (en) Image processing apparatus and method including line segment data extraction
JP2003132358A (en) Image processing method, device and system
US5075895A (en) Method and apparatus for recognizing table area formed in binary image of document
US20120243785A1 (en) Method of detection document alteration by comparing characters using shape features of characters
JP2008227930A (en) Image forming apparatus and method
JP2010028309A (en) Apparatus, method, program, and storage medium
JP4232679B2 (en) Image forming apparatus and program
US20100328728A1 (en) Image forming apparatus, image forming method and storage medium
JP2002199206A (en) Method and device for imbedding and extracting data for document, and medium
JP2002015280A (en) Device and method for image recognition, and computer- readable recording medium with recorded image recognizing program
JP2010056691A (en) Device and method for processing image
JPH05276382A (en) Method for processing picture and device therefor
JP2000137765A5 (en)
JP2007174615A (en) Image processor, image processing method, program, storage medium
JP3019086B1 (en) Image processing apparatus, image processing method, and image forming apparatus
US20080304700A1 (en) Image forming apparatus and method of image forming
JP4281236B2 (en) Image recognition apparatus, image recognition method, and computer-readable recording medium storing image recognition program
JP2001222683A (en) Method and device for processing picture, device and method for recognizing character and storage medium
JP4998176B2 (en) Translation apparatus and program
JPH0916582A (en) Document preparing device and method for outputting recognition result used for this device
JP3717357B2 (en) Binary threshold calculation method and apparatus
JP2007011939A (en) Image decision device and method therefor
US20100104131A1 (en) Document processing apparatus and document processing method
JP4821616B2 (en) Information embedding device, embedded information acquisition device