JP2000020677A

JP2000020677A - Image capture/massage display device

Info

Publication number: JP2000020677A
Application number: JP10163268A
Authority: JP
Inventors: C S Chan Jackson; ジャクソン・シイ・エス・チャン; D S Ho David; デイヴィッド・ディ・エス・ホ; H Wai Chen Fred; フレッド・エイチ・ワイ・チェン; X S Wan Samson; サムソン・エックス・エス・ワン; Lee Rin; リン・リー
Original assignee: Inventec Corp
Current assignee: Inventec Corp
Priority date: 1998-06-11
Filing date: 1998-06-11
Publication date: 2000-01-21

Abstract

PROBLEM TO BE SOLVED: To obtain a device capable of capturing images of an object with a camera and displaying information related to the object on the device by identifying a selected image as a specific object having a code to be recognized by a computer system by processing the selected image, and extracting and displaying the information related to the object. SOLUTION: This device 10 captures optical images of the object existing in front of a lens 18, arranged directly above a lateral image width guide 24 and generates an optical image signal of the object. After receiving an instruction from a 1st switch 34 for taking a snapshot image of the object by the lens 18 and other constitutional parts in the device 10, the object is identified and recognized, information related to the object is checked, and the image of the object and information related to the object are displayed on a display. The object can be expressed as words, numbers, an image or a graph which are two-dimensional type printed data, or as a three-dimensional object such as a clock, a dog and an automobile.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は一般に画像処理シス
テムに関し、さらに詳細には、対象の画像を取り込み、
取り込んだ画像を認識した対象として認識し、認識した
対象に関係する情報を取り出し、取り出した情報をディ
スプレイ上に表示するカメラを有するシステムに関す
る。FIELD OF THE INVENTION The present invention relates generally to image processing systems, and more particularly, to capturing an image of a subject,
The present invention relates to a system having a camera that recognizes a captured image as a recognized target, extracts information related to the recognized target, and displays the extracted information on a display.

【０００２】[0002]

【従来の技術】現代社会は、個人が吸収すべき情報およ
び知識が多い環境であり、このような情報は、多くの形
式で利用することができる。例えば、それはテレビ番
組、ラジオ放送、インターネットなどから来る可能性が
ある。それにも関わらず、最も幅広く使用される情報ソ
ースは依然として印刷媒体、すなわち新聞、雑誌、また
は書籍である。新聞または書籍は、それらが印刷された
形態で情報を提供し、将来使用する際に信頼して参照す
ることができる点でその他の媒体と異なる。インターネ
ットから得た情報でさえ、将来参照するためにしばしば
ハードコピーに印刷される。2. Description of the Related Art Modern society is an environment in which individuals have a lot of information and knowledge to be absorbed, and such information is available in many forms. For example, it can come from television programs, radio broadcasts, the Internet, etc. Nevertheless, the most widely used information sources are still print media, ie newspapers, magazines, or books. Newspapers or books differ from other media in that they provide information in printed form and can be reliably referenced for future use. Even information obtained from the Internet is often printed on hardcopy for future reference.

【０００３】学生は、通常は、教師から与えられた教科
書その他の印刷資料を読むことによって知識を得る。専
門家は自身の専門分野の雑誌またはニュースレターを読
んで時代遅れにならないようにしている。技術者および
科学者はデータブック、技術機関誌、またはマーケティ
ングや取引の雑誌を読んで、それぞれの専門分野での最
新の事情に通じているようにしている。セールスマンお
よび市場の人々は市場雑誌を読んで、彼らが売る製品の
傾向を知るようにしている。印刷形式で利用できる大量
の情報は、基本的に、個人がこうした資料を読むことに
彼または彼女の日常生活のかなりの部分を当てることを
要求する。文書を読む時には、人は時々辞書で語句を調
べる必要があることもある。多くの知らない単語を辞書
で調べる必要が生じると、人は挫折する可能性がある。
このことは、人が読むことを妨げ、それにより彼または
彼女の学習能力を妨げる可能性がある。[0003] Students usually gain knowledge by reading textbooks and other printed materials provided by the teacher. Experts read magazines or newsletters in their areas of expertise to keep them out of date. Engineers and scientists read data books, technical journals, or marketing and trading magazines to keep up to date with their specialties. Salesmen and market people read market magazines to learn about the trends in the products they sell. The large amount of information available in printed form basically requires an individual to spend a significant portion of his or her daily life reading these materials. When reading a document, people sometimes need to look up words in a dictionary. If you need to look up many unknown words in a dictionary, you can be frustrated.
This can prevent a person from reading, thereby hindering his or her learning ability.

【０００４】電子辞書は、辞書の機能を提供し、従来の
辞書のページをめくることなく人が単語の意味を調べる
ことができるようにする装置である。さらに、最近の技
術開発により、単語の発音を言語化することができる音
声機能を有する電子辞書の製造が可能になった。しか
し、従来技術の電子辞書の１つの大きな欠点は、それら
ではユーザが所望の単語をキー入力する必要がある点で
ある。さらに最近になって、手書き認識技術により、特
殊な書込みパッド上での手書きによる単語の入力が可能
になった。このキー入力または手書き操作は、速度が遅
くなりがちでエラーが生じやすく、これにより文書を理
解する速度がさらに遅くなる。活字上の誤りにより電子
辞書が間違った単語を突き止めた場合には、その結果ユ
ーザが文章の意味を誤解する可能性がある。ユーザが、
彼または彼女が活字上の誤りを犯したことに気づいてい
ない場合には特にそのようになる。したがって、電子辞
書が、従来のタイプ入力または書込み入力方法以外の入
力を受け入れることが望ましい。[0004] An electronic dictionary is a device that provides the function of a dictionary and allows a person to look up the meaning of a word without turning over the pages of a conventional dictionary. Furthermore, recent technological developments have made it possible to produce electronic dictionaries with voice functions that can verbalize the pronunciation of words. However, one major disadvantage of prior art electronic dictionaries is that they require the user to key in the desired word. More recently, handwriting recognition technology has made it possible to input words by handwriting on special writing pads. This key entry or handwriting operation tends to be slow and error prone, thereby further slowing down the understanding of the document. If the electronic dictionary finds the wrong word due to a typographical error, the user may misunderstand the meaning of the sentence as a result. The user
This is especially true if he or she is unaware that he has made a typographical error. Therefore, it is desirable for the electronic dictionary to accept inputs other than conventional typed or written input methods.

【０００５】電子辞書は、電子オーガナイザ程度のサイ
ズで携帯可能になったが、持ち歩くには依然としてかさ
ばりすぎる。したがって、電子辞書は、ユーザが持ち運
ぶのにそれほど負担にならずに容易に利用できるよう
に、ポケットに入れることができることが望ましい。Although electronic dictionaries have become portable in the size of an electronic organizer, they are still too bulky to carry around. Therefore, it is desirable that the electronic dictionary can be put in a pocket so that the user can easily use the electronic dictionary without burdening the user.

【０００６】従来技術のほとんどのスキャナは、文書の
ページ全体または行を走査するためのものである。これ
は電子辞書での使用には適さない。電子辞書は通常は単
一の単語の定義を提供するのみであるので、文書のペー
ジ全体、または１行の走査でも負担になり、不経済にな
ることがある。したがって、一度に１つの単語またはい
くつかの単語の画像を取り込むことができる、携帯可能
な軽量の装置を有することが望ましい。[0006] Most prior art scanners are for scanning entire pages or lines of a document. This is not suitable for use in electronic dictionaries. Because electronic dictionaries typically only provide the definition of a single word, scanning an entire page of a document, or even a single line, can be burdensome and uneconomical. Therefore, it is desirable to have a portable, lightweight device that can capture images of one word or several words at a time.

【０００７】[0007]

【発明が解決しようとする課題】したがって、本発明の
目的は、カメラで対象の画像を取り込み、その対象に関
係する情報をその装置上に表示することができる装置を
提供することである。SUMMARY OF THE INVENTION It is, therefore, an object of the present invention to provide an apparatus capable of capturing an image of an object with a camera and displaying information relating to the object on the apparatus.

【０００８】本発明の別の目的は、単語の画像を取り込
んでユーザが単語を手動でキー入力する必要をなくし、
その単語に関係する情報を装置に取り付けられたディス
プレイ上に表示することができる電子辞書を提供するこ
とである。Another object of the present invention is to capture an image of a word, eliminating the need for the user to manually key in the word,
An electronic dictionary capable of displaying information related to the word on a display attached to the device.

【０００９】本発明の別の目的は、ユーザが快適に持ち
運ぶことができる携帯用電子辞書を提供することであ
る。Another object of the present invention is to provide a portable electronic dictionary that can be carried comfortably by a user.

【００１０】[0010]

【課題を解決するための手段】本発明の特定の実施態様
では、１つまたは複数の選択した対象の１つまたは複数
の画像を取り込んで表示し、選択した画像を処理してコ
ンピュータ・システムが認識できるコードを有する特定
の対象としてその選択した画像を識別し、それに関係す
る情報を取り出して表示する装置が開示されている。１
実施態様では、この装置は、前端および後端を有するボ
ディと、レンズ取付けリムによって前端に結合された、
レンズの前にある対象の光学画像を取り込むレンズと、
ボディに取り付けられたディスプレイと、ボディ上に位
置決めされたスイッチと、距離ガイドとを含む。間隔ガ
イドは、スタンドオフ・アームおよび横向き画像幅ガイ
ドをさらに含む。スタンドオフ・アームはボディの前部
から前方に延び、２つの端部を有し、その一端はボディ
に結合され、もう一端は横向き画像幅ガイドに結合され
る。画像取込み装置は、ユーザが選択した対象の画像を
表す信号を絶えず生成し、受信信号が表す画像を実時間
で表示し、スイッチが活動化するとそれにより選択した
対象の静止画像を撮り、次いでこの静止画像を特定の対
象として識別し、それに関係する情報を取り出す。SUMMARY OF THE INVENTION In certain embodiments of the present invention, a computer system captures and displays one or more images of one or more selected objects, processes the selected images, and provides a computer system. A device is disclosed that identifies the selected image as a particular object having a recognizable code, retrieves and displays information associated with the selected image. 1
In an embodiment, the device is coupled to a body having a front end and a rear end, and a front end by a lens mounting rim.
A lens for capturing an optical image of the object in front of the lens,
Includes a display mounted on the body, a switch positioned on the body, and a distance guide. The spacing guide further includes a standoff arm and a sideways image width guide. The standoff arm extends forward from the front of the body and has two ends, one end of which is connected to the body and the other end of which is connected to the sideways image width guide. The image capture device constantly generates a signal representing the image of the object selected by the user, displays the image represented by the received signal in real time, takes a still image of the selected object when the switch is activated, and then takes this The still image is identified as a specific object, and information related thereto is extracted.

【００１１】本発明の１つの利点は、カメラから対象の
画像を取り込み、その対象に関係する情報をその装置上
に表示することができる装置が提供されることである。One advantage of the present invention is that an apparatus is provided that can capture an image of an object from a camera and display information related to the object on the apparatus.

【００１２】本発明の別の利点は、キーボードまたは書
込みパッドから単語を入力しないで単語を取り込むこと
ができ、その単語に関係する情報を装置に取り付けられ
たディスプレイ上に表示することができる電子辞書が提
供されることである。Another advantage of the present invention is that an electronic dictionary capable of capturing words without entering the words from a keyboard or writing pad and displaying information related to the words on a display attached to the device. Is to be provided.

【００１３】本発明の別の利点は、ユーザが快適に持ち
運ぶことができる携帯用電子辞書が提供されることであ
る。[0013] Another advantage of the present invention is that a portable electronic dictionary is provided that can be carried comfortably by a user.

【００１４】本発明のその他の目的、特徴、および利点
は、いくつかの図面を参照する以下の好ましい実施形態
の詳細な説明を読めば当業者には明らかになるであろ
う。[0014] Other objects, features and advantages of the present invention will become apparent to those skilled in the art from a reading of the following detailed description of a preferred embodiment with reference to the several drawings.

【００１５】[0015]

【発明の実施の形態】本発明は多くの形態で実施するこ
とができるが、好ましい実施形態の細部を図１から９に
概略的に示す。本開示は、本発明を例示した実施形態に
限定するためのものではないことを理解されたい。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS While the invention may be embodied in many forms, details of a preferred embodiment are schematically illustrated in FIGS. It is to be understood that this disclosure is not intended to limit the invention to the illustrated embodiments.

【００１６】本発明は、画像取込み／メッセージ表示装
置を扱うものである。この装置は対象の光学画像を取り
込む。対象は、２次元形式の印刷資料の単語、数字、
図、グラフにすることも、あるいは時計、犬、自動車な
どの３次元物体などにすることもできる。電子辞書で
は、対象はいくつかの文字からなる単語となる。この装
置は単語を文字のサブパートに分割し、各文字の光学画
像信号をデジタル信号に変換する。次いでこの装置は、
単語について認識プロセスを実行し、単語に関係する情
報をデータベース中で探索し、発見した情報を取り出
し、その情報を装置のビデオ表示画面上に表示する。The present invention deals with an image capture / message display device. This device captures an optical image of an object. The target is words, numbers,
It can be a diagram, a graph, or a three-dimensional object such as a watch, a dog, a car, or the like. In an electronic dictionary, the target is a word consisting of several characters. The device divides words into character subparts and converts the optical image signal of each character into a digital signal. This device then
A recognition process is performed on the word, information related to the word is searched in a database, the found information is retrieved, and the information is displayed on a video display screen of the device.

【００１７】図１を参照すると、画像取込み／メッセー
ジ表示装置の現在好ましい実施形態の斜視図が示されて
いる。画像取込み／メッセージ表示装置１０はボディ１
２を有する。この実施形態では、ボディ１２は、ユーザ
がちょうどペンを持つように彼または彼女の手で快適に
持つことができるペンに似ている。装置１０は、前端１
４および後端１６を有する。レンズ１８はボディ１２の
前端１４に位置する。レンズ１８の奥のボディ１２の内
側には、対象の光学画像を取り込むためのＣＣＤベース
・カメラ（図示せず）が配置される。レンズ１８はその
周囲にレンズ取付けリム１９を有する。リム１９はボデ
ィ１２に、スナップ留め、溶接、ねじ留め、またはその
他任意の適当な方法で固定することができる。スタンド
オフ・アーム２０はボディ１２の前端１４から前方に延
びる。スタンドオフ・アーム２０は２つの端部を有し、
アーム２０の一端２２はボディ１２に接続される。アー
ム２０の末端２６に接続された横向き画像幅ガイド２４
は適当な手段を介してアーム２０に固定される。横向き
画像幅ガイド２４は、所定角度でアーム２０に固定され
る。横向き画像幅ガイド２４がスタンドオフ・アーム２
０に固定される角度は、この装置の動作には影響を及ぼ
さない。ガイド２４は、アーム２０に直角に固定するこ
ともできる。別法として、横向き画像幅ガイド２４は、
ボディ１２に対してどのような角度にもなる融通性を有
するように、玉継手でスタンドオフ・アーム２０に結合
することもできる。玉継手を使用してガイド２４をアー
ム２０に接続することの１つの利点は、ユーザが、どの
ように傾いた姿勢でも楽にそれを使用することができる
ことである。もう１つの利点は、それにより間隔ガイド
機構の耐久性が向上し、アーム２０が破損しにくくなる
ことである。スタンドオフ・アーム２０と横向き画像幅
ガイド２４は共に間隔ガイド２５を形成する。アーム２
０の長さはレンズ１８の焦点距離によって決定され、そ
の結果レンズ１８は、常にレンズの前のアーム２０の長
さと等しい距離にある対象上で集束することになる。ボ
ディ１２の後端１６では、キャップ４２がボディ１２に
固定される。レンズ取付けリム１９と同様に、キャップ
４２はボディ１２にスナップ留めするか、またはねじ留
めすることができる。バッテリ（図示せず）は、後端１
６からボディ１２の内側に設置され、装置１０の動作用
の電力を供給する。Referring to FIG. 1, there is shown a perspective view of a presently preferred embodiment of the image capture / message display. The image capture / message display device 10 is the body 1
2 In this embodiment, the body 12 resembles a pen that the user can comfortably hold in his or her hand just like a pen. The device 10 has a front end 1
4 and a rear end 16. The lens 18 is located at the front end 14 of the body 12. A CCD-based camera (not shown) for capturing an optical image of an object is arranged inside the body 12 behind the lens 18. The lens 18 has a lens mounting rim 19 around it. Rim 19 may be secured to body 12 by snapping, welding, screwing, or any other suitable method. Standoff arm 20 extends forward from front end 14 of body 12. The standoff arm 20 has two ends,
One end 22 of the arm 20 is connected to the body 12. Sideways image width guide 24 connected to distal end 26 of arm 20
Is fixed to the arm 20 via suitable means. The horizontal image width guide 24 is fixed to the arm 20 at a predetermined angle. The horizontal image width guide 24 is the stand-off arm 2
The angle fixed at 0 has no effect on the operation of this device. The guide 24 may be fixed to the arm 20 at right angles. Alternatively, the horizontal image width guide 24
The ball joint may be coupled to the stand-off arm 20 so that it has any angle of flexibility with respect to the body 12. One advantage of using a ball joint to connect the guide 24 to the arm 20 is that the user can easily use it in any inclined position. Another advantage is that it increases the durability of the spacing guide mechanism and makes the arm 20 less susceptible to breakage. Standoff arm 20 and landscape image width guide 24 together form spacing guide 25. Arm 2
The length of zero is determined by the focal length of the lens 18 so that the lens 18 will always focus on an object at a distance equal to the length of the arm 20 in front of the lens. At the rear end 16 of the body 12, a cap 42 is fixed to the body 12. Like the lens mounting rim 19, the cap 42 can be snapped or screwed to the body 12. The battery (not shown) is connected to the rear end 1
6 is installed inside the body 12 to supply power for operating the device 10.

【００１８】フラットパネル・ディスプレイであるディ
スプレイ２８は、装置１０のボディ１２上に位置決めさ
れる。現在好ましい実施形態では、ディスプレイは液晶
ディスプレイである。複数のボタン３０、３２が、ディ
スプレイ２８の輝度を調節するためにディスプレイ２８
付近に配置される。第１ボタン３０を押すとディスプレ
イの輝度が増す。第２ボタン３２を押すとディスプレイ
の輝度が低下する。ボディ上には、複数のスイッチ３
４、３６、３８、４０もある。第１スイッチ３４は装置
１０中のレンズその他の構成部品によって対象のスナッ
プショット画像を撮るためのものである。その他の構成
部品およびそれらの操作について以下の説明で論じる。
その他３つのスイッチ３６、３８、４０は、レンズの焦
点の調節、対象を見るズーミングの制御、対象のサンプ
ル音声発音の生成、音声認識動作の活動化、通信動作の
活動化、または単なるオン／オフ・スイッチとしての機
能など、その他の適用業務のために使用することができ
る。これら３つのスイッチの別の代替として、これらを
ｓｈｉｆｔキー、ａｌｔキー、ｃｏｎｔｒｏｌキーとし
て使用し、スイッチの適用業務の組合せをより多くする
こともできる。こうした追加の機能は、ディスプレイの
輝度を制御するボタン３０、３２にも適用することがで
きる。A display 28, which is a flat panel display, is positioned on the body 12 of the device 10. In a currently preferred embodiment, the display is a liquid crystal display. A plurality of buttons 30, 32 are provided on the display 28 to adjust the brightness of the display 28.
It is placed near. Pressing the first button 30 increases the brightness of the display. Pressing the second button 32 reduces the brightness of the display. Multiple switches 3 on the body
There are also 4, 36, 38 and 40. The first switch 34 is for taking a snapshot image of an object by a lens or other components in the device 10. Other components and their operation are discussed in the following description.
The other three switches 36, 38, 40 control the focus of the lens, control the zooming of the subject, generate a sample speech pronunciation of the subject, activate a speech recognition operation, activate a communication operation, or simply on / off. Can be used for other applications, such as functioning as a switch. As another alternative to these three switches, they can be used as a shift key, an alt key, and a control key to provide more combinations of switch applications. These additional functions can also be applied to buttons 30, 32 that control the brightness of the display.

【００１９】装置１０は、横向き画像幅ガイド２４の真
上にあるレンズ１８の前の対象の光学画像を取り込み、
対象の光学画像信号を生成し、第１スイッチ３４からの
指示を受信した後で対象を識別および認識し、対象に関
係する情報を調べ、対象の画像およびそれが関係する情
報をディスプレイ上に表示することができる。対象は、
単語、数字、またはその他の形態の対象になる可能性が
ある。The apparatus 10 captures an optical image of the object in front of the lens 18 just above the horizontal image width guide 24,
Generating an optical image signal of the object, identifying and recognizing the object after receiving an instruction from the first switch 34, examining information related to the object, displaying the image of the object and the information related thereto on the display can do. The target is
Can be subject to words, numbers, or other forms.

【００２０】装置１０はペンと同様に携帯可能である。
電子辞書の例では、画像取込み／メッセージ表示装置１
０は、単語の光学画像を取り込み、その単語に関係する
定義その他の情報を発見し、その単語とその定義および
情報とをディスプレイ上に表示する。個人情報管理（Ｐ
ＩＭ）システムでは、画像取込み／メッセージ表示装置
１０は、個人名または団体名を取り込み、接触した個人
の電話および／またはファクシミリ番号、住所、会合の
予定および／または日程などを発見する。暦上の行事ま
たは特定の日付の予約も、カレンダーの日付を取り込
み、その日付を認識し、その日の予定を取り出し、その
予定をディスプレイ上に表示することによって取り出す
ことができる。同様に、名刺についての情報を画像取込
み装置で取り込み、データベースに記憶することもでき
る。名刺の基本要素を識別し、画像取込み装置を介して
その要素の静止画像を取り込むことによって、データベ
ースに記憶された情報を取り出すことができる。The device 10 is as portable as a pen.
In the example of the electronic dictionary, the image capture / message display device 1
0 captures the optical image of the word, finds definitions and other information related to the word, and displays the word and its definitions and information on the display. Personal information management (P
In an IM) system, the image capture / message display 10 captures an individual or organization name and finds the telephone and / or facsimile number, address, meeting schedule and / or schedule of the contacting individual. A calendar event or appointment for a particular date can also be retrieved by capturing the calendar date, recognizing that date, retrieving the appointment for the day, and displaying the appointment on the display. Similarly, information about a business card can be captured by an image capturing device and stored in a database. By identifying the basic elements of a business card and capturing a still image of the element via an image capture device, the information stored in the database can be retrieved.

【００２１】レンズ１８は円形レンズにすることができ
る。しかし、電子辞書では、その画像をこの装置が取り
込むことになる対象は、通常は長方形の形状をしてい
る。これは、英語その他の西洋の言語では、各単語がい
くつかのアルファベットから生成されるためである。ア
ルファベットをまとめると、それらは通常は左から右に
配列され、長方形の形状をした単語となる。したがっ
て、長方形形状の対象の光学画像を取り込むために、レ
ンズ１８の形状を円形にする必要はない。本発明の画像
取込みプロセスはほぼ長方形の形状をした対象に関係す
るので、これを円形レンズの一部分にすることができ
る。別法として、レンズをフレネル・レンズまたはフレ
ネル・レンズの一部分にし、レンズの厚さを薄くするこ
ともできる。また、円柱レンズまたは円柱レンズの一部
分にすることもできる。ただし、中国語や韓国語などの
外国語の場合には、単語の形状は通常は正方形になる。
したがって、中国語または韓国語の電子辞書で使用する
レンズの形状は長方形ではなく円形にすることもある。The lens 18 can be a circular lens. However, in an electronic dictionary, an object whose image is to be captured by the device is usually in a rectangular shape. This is because in English and other Western languages, each word is generated from several alphabets. To put the alphabet together, they are usually arranged from left to right, resulting in words in the shape of a rectangle. Therefore, it is not necessary to make the shape of the lens 18 circular in order to capture an optical image of a rectangular target. Since the image capture process of the present invention involves a substantially rectangular shaped object, it can be part of a circular lens. Alternatively, the lens may be a Fresnel lens or a part of a Fresnel lens, and the thickness of the lens may be reduced. It can also be a cylindrical lens or a part of a cylindrical lens. However, in the case of a foreign language such as Chinese or Korean, the word shape is usually a square.
Therefore, the shape of the lens used in the Chinese or Korean electronic dictionary may be circular instead of rectangular.

【００２２】図２は、現在好ましい実施形態の代替実施
形態を示す斜視図である。この図にはコンピュータのマ
ウスに似た、対象を取り込み、対象に関係する情報を表
示する第２装置４４が示されている。図１で参照した装
置１０とこの第２装置４４とを区別するために、本発明
ではこの第２装置４４をマウス４４と呼ぶ。FIG. 2 is a perspective view showing an alternative embodiment of the presently preferred embodiment. This figure shows a second device 44, similar to a computer mouse, that captures an object and displays information related to the object. In order to distinguish the device 10 referred to in FIG. 1 from the second device 44, the second device 44 is referred to as a mouse 44 in the present invention.

【００２３】マウス４４はボディ４６を有する。マウス
４４は前端４８および後端５０を有する。マウス４４
は、頂部５２、底部（図示せず）、および２つの側面５
４、５６を有する。スタンドオフ・キャリア５８は前端
４８に上向きに取り付けられ、マウスのボディ４６の前
端４８に対する所定位置にレンズ（図示せず）を位置決
めする。スタンドオフ・キャリア５８は、上向きライザ
６２部分およびプラットフォーム６４部分を含む。上向
きライザ６２はボディ４６の前端４８に回転可能に取り
付けられ、プラットフォーム６４をボディ４６の上方の
所定高さまで上昇させるようになっている。プラットフ
ォーム６４は上側表面６６および下側表面６８を含む。
レンズは、その光学軸がスタンドオフ・キャリア５８の
上向きライザ６２と平行になるように、下側表面６８に
取り付けられる。複数の高い側面７０に取り囲まれた上
側表面６６および下側表面６８は、その中にハウジング
（図示せず）を区画する。上側表面６６と下側表面６８
の間の距離は、レンズの奥にＣＣＤベース・カメラを置
く十分なスペースを提供する。The mouse 44 has a body 46. Mouse 44 has a front end 48 and a rear end 50. Mouse 44
Has a top 52, a bottom (not shown), and two sides 5
4, 56. A standoff carrier 58 is mounted upwardly at the front end 48 and positions a lens (not shown) in position with respect to the front end 48 of the body 46 of the mouse. Standoff carrier 58 includes an upward riser 62 portion and a platform 64 portion. The upward riser 62 is rotatably attached to the front end 48 of the body 46 so as to raise the platform 64 to a predetermined height above the body 46. Platform 64 includes an upper surface 66 and a lower surface 68.
The lens is mounted on the lower surface 68 such that its optical axis is parallel to the upward riser 62 of the standoff carrier 58. An upper surface 66 and a lower surface 68 surrounded by a plurality of tall sides 70 define a housing (not shown) therein. Upper surface 66 and lower surface 68
The distance between provides sufficient space to place the CCD-based camera behind the lens.

【００２４】マウス４４の前端４８の底部付近で、ガイ
ド部材７２が前端４８から延びている。ガイド部材７２
は、その中に開口７４を形成している長方形の形状をし
ている。開口７４の形状もほぼ長方形である。ガイド部
材７２は、レンズの光学軸が開口７４を通過するように
位置決めされる。Near the bottom of the front end 48 of the mouse 44, a guide member 72 extends from the front end 48. Guide member 72
Has a rectangular shape with an opening 74 formed therein. The shape of the opening 74 is also substantially rectangular. The guide member 72 is positioned so that the optical axis of the lens passes through the opening 74.

【００２５】好ましい実施形態では、プラットフォーム
６４は軸６９を介して上向きライザ６２に回転可能に固
定される。プラットフォーム６４を回転させ、プラット
フォーム６４を下向きに折りたたんで上向きライザ６２
と平行にすることもできる。上向きライザ６２は、軸
（図示せず）でボディ４６に回転可能に取り付けられて
いる。上向きライザ６２およびプラットフォーム６４を
含むスタンドオフ・キャリア５８は、上向きライザ６２
の前にこれと並べて、下向きに折りたたむことができ
る。その後スタンドオフ・キャリア５８をボディ４６中
に滑らせて入れることができる。ガイド部材７２は滑動
可能キャリア（図示せず）上に取り付けられ、マウス４
４のボディ４６中に引き込むことができる。この特徴の
詳細な機構は同時係属の特許出願に記載されている。In the preferred embodiment, platform 64 is rotatably secured to upward riser 62 via a shaft 69. The platform 64 is rotated, and the platform 64 is folded downward so that the upward riser 62
Can also be parallel. The upward riser 62 is rotatably attached to the body 46 by a shaft (not shown). The standoff carrier 58, which includes the upward riser 62 and the platform 64,
You can fold it down side by side with this before. The standoff carrier 58 can then be slid into the body 46. The guide member 72 is mounted on a slidable carrier (not shown),
4 can be retracted into the body 46. The detailed mechanism of this feature is described in the co-pending patent application.

【００２６】ディスプレイ７６はボディ４６の頂部５２
上に位置決めされる。現在好ましい実施形態では、ディ
スプレイはＬＣＤディスプレイである。複数のスイッチ
７８、８０は、ボディ４６の頂部５２上に配置される。
別法として、スイッチ８２をボディ４６の側面５４、５
６の片方または両方に配置することもできる。第１スイ
ッチ７８は、マウス４４中のレンズおよびＣＣＤベース
・カメラを介して対象のスナップショット画像を撮るた
めのものである。その他のスイッチ８０、８２は、上述
のその他の適用業務に使用することができる。マウス４
４の底部からアクセス可能なバッテリを格納することが
できる部屋がボディ４６中にある。このバッテリ部屋
は、バッテリを正しい位置で保持するスナップ留めタイ
プのカバーを有する。The display 76 is located on the top 52 of the body 46.
Is positioned on top. In a currently preferred embodiment, the display is an LCD display. A plurality of switches 78, 80 are located on top 52 of body 46.
Alternatively, switch 82 may be connected to sides 54, 5
6 or one or both. The first switch 78 is for taking a snapshot image of the object via the lens in the mouse 44 and the CCD-based camera. Other switches 80, 82 can be used for the other applications described above. Mouse 4
There is a room in body 46 that can store a battery accessible from the bottom of 4. The battery compartment has a snap-on type cover that holds the battery in place.

【００２７】図１の画像取込み／メッセージ表示装置１
０で説明したレンズと同様に、マウス型装置４４で使用
するレンズもいくつかの形状で作成することができる。
これは、円形レンズ、円形レンズの一部分、フレネル・
レンズ、フレネル・レンズの一部分、円柱レンズ、また
は円柱レンズの一部分にすることができる。Image capture / message display device 1 of FIG.
Similar to the lens described above, the lens used in the mouse-type device 44 can be made in several shapes.
This is a circular lens, part of a circular lens, Fresnel
It can be a lens, a part of a Fresnel lens, a cylindrical lens, or a part of a cylindrical lens.

【００２８】図３は、対象８４の画像を取り込み、対象
８４に関係する情報をディスプレイ２８上に表示する装
置１０またはマウス４４の好ましい実施形態のブロック
図である。装置１０およびマウス４４の内部の機能アー
キテクチャおよび動作は同一であるので、以下の記述は
装置１０に関連する説明であるが、それらはマウス４４
にも等しく適用することができる。装置１０は、レンズ
１８（またはマウス４４では６２）および対象８４の光
学画像を感知する２次元画像センサ８８を有するカメラ
８６と、アナログ／デジタル（Ａ／Ｄ）変換器９０と、
デジタル信号プロセッサ（ＤＳＰ）９２と、読取り専用
メモリ（ＲＯＭ）９４と、ランダム・アクセス・メモリ
（ＲＡＭ）９６と、ディスプレイ２８と、スイッチ３４
−４０とを含む。FIG. 3 is a block diagram of a preferred embodiment of device 10 or mouse 44 for capturing an image of object 84 and displaying information related to object 84 on display 28. Since the functional architecture and operation inside the device 10 and the mouse 44 are the same, the following description is a description related to the device
Can be applied equally. The device 10 includes a camera 86 having a lens 18 (or 62 for the mouse 44) and a two-dimensional image sensor 88 for sensing an optical image of the object 84, an analog / digital (A / D) converter 90,
Digital signal processor (DSP) 92, read only memory (ROM) 94, random access memory (RAM) 96, display 28, switch 34
-40.

【００２９】２次元画像センサ８８はレンズ１８の奥に
配置される。２次元画像センサ８８はレンズ１６を通過
した対象８４の光学画像信号を受信し、この光学画像信
号をアナログ画像信号に変形し、このアナログ画像信号
をアナログ／デジタル（Ａ／Ｄ）変換器９０に転送す
る。Ａ／Ｄ変換器９０はアナログ画像信号からデジタル
画像信号を生成し、このデジタル画像信号をデジタル信
号プロセッサ（ＤＳＰ）９２に送信する。スイッチ３４
はＤＳＰ９２に結合され、装置１０の横向き画像幅ガイ
ド２４（図１参照）の上、またはガイド部材７２の開口
７４（図２参照）中にある対象８４のスナップショット
静止画像を撮るために、ユーザによって制御される。Ｄ
ＳＰ９２はＲＯＭ９４およびＲＡＭ９６に結合される。
ＲＯＭ９４はＤＳＰ９２を制御する命令を含む。ＲＯＭ
９４には、対象８４に関係する全ての情報を含むデータ
ベース、または装置１０が実行する文字認識動作のため
のテンプレート・ライブラリも含まれる。ＲＯＭ９４
は、装置１０の電源をオフにする前に最新の情報を保存
することができるように、フラッシュ・メモリの形態に
することもできる。ＲＡＭ９６は主にＤＳＰ９２によっ
て制御される装置動作のための汎用メモリである。ＤＳ
Ｐ９２は２次元画像センサ８８に電気的に結合され、２
次元画像センサ８８の動作を制御する。The two-dimensional image sensor 88 is arranged behind the lens 18. The two-dimensional image sensor 88 receives the optical image signal of the object 84 passing through the lens 16, transforms the optical image signal into an analog image signal, and converts the analog image signal into an analog / digital (A / D) converter 90. Forward. The A / D converter 90 generates a digital image signal from the analog image signal, and transmits the digital image signal to a digital signal processor (DSP) 92. Switch 34
Is coupled to the DSP 92 and allows the user to take a snapshot still image of the object 84 on the landscape image width guide 24 of the device 10 (see FIG. 1) or in the opening 74 of the guide member 72 (see FIG. 2). Is controlled by D
SP 92 is coupled to ROM 94 and RAM 96.
ROM 94 contains instructions for controlling DSP 92. ROM
94 also includes a database containing all information related to the object 84, or a template library for character recognition operations performed by the device 10. ROM94
Can be in the form of flash memory so that the latest information can be stored before the device 10 is powered off. The RAM 96 is a general-purpose memory mainly for the device operation controlled by the DSP 92. DS
P92 is electrically coupled to the two-dimensional image sensor 88,
The operation of the three-dimensional image sensor 88 is controlled.

【００３０】マイクロフォン、スピーカ、および通信手
段もこの装置に含めることができ、これら全てはＤＳＰ
９２に結合され、音声認識、音声合成、または通信機能
を提供する。マイクロフォン、スピーカ、通信、または
装置のオン／オフを制御するために、追加のスイッチを
ＤＳＰ９２に結合することもできる。Microphones, speakers and communication means can also be included in this device, all of which are DSP
92, and provides speech recognition, speech synthesis, or communication functions. Additional switches may be coupled to the DSP 92 to control the on / off of the microphone, speaker, communication, or device.

【００３１】好ましい実施形態では、装置１０は対象８
４の画像を取り込む。対象８４の画像を取り込むため
に、レンズ１８は、対象８４から距離１００だけ離して
配置しなければならない。カメラ８６が対象８４の鮮明
な画像を取り込むためには、距離１００は、ほぼレンズ
１８の焦点距離以内でなければならない。ペン型装置１
０の場合には、この距離はスタンドオフ・アーム２０の
長さによって計算され、予め決定される。マウス型装置
４４の場合には、距離１００は、レンズとガイド部材７
２の開口７４中に配置された対象との間の距離、すなわ
ちほぼ上向きライザ６２の長さになるはずである。In a preferred embodiment, the device 10
4 is captured. In order to capture an image of the object 84, the lens 18 must be placed at a distance 100 from the object 84. Distance 100 must be approximately within the focal length of lens 18 for camera 86 to capture a sharp image of object 84. Pen type device 1
If zero, this distance is calculated and predetermined by the length of the standoff arm 20. In the case of the mouse-type device 44, the distance 100 is the distance between the lens and the guide member 7.
It should be the distance to the object located in the two openings 74, ie approximately the length of the upward riser 62.

【００３２】対象８４は、文書中の単語などのテキスト
形式になることがある。画像取込み装置１０は、開口７
４内、または横向き画像幅ガイド２４の上にある対象の
画像を取り込む。例えば、電子辞書では、現在好ましい
実施形態の画像取込み装置１０は、ちょうどカメラで写
真を撮る場合と同様に、単一動作で単語の画像を取り込
む。従来技術では、単語の画像は通常は、文書のページ
または行の一部分として走査され、コンピュータ中に入
れられる。従来技術のスキャナは、文書の同じ行にある
その他の単語の中から１つの単語を選択的に操作するこ
とはできない。したがって、電子辞書のユーザは、スキ
ャナを入力装置として利用して単語の定義を取り出すこ
とはできない。同様に、ＰＩＭユーザは、従来のスキャ
ナを介してカレンダーの日付を走査することによって、
彼または彼女の特定の日の予約を照会することはできな
い。本発明の画像取込み装置１０は、単一の単語、いく
つかの単語、または単語の長さによってその単語の一部
分の画像を取り込むことができる。これにより、この装
置では、小さな対象または１つの項目の小さな一部分の
画像を取り込む際にさらに融通がきくようになる。The object 84 may be in the form of a text such as a word in a document. The image capturing device 10 has an opening 7
4 or an image of the object located on the horizontal image width guide 24. For example, in an electronic dictionary, the image capture device 10 of the presently preferred embodiment captures an image of a word in a single operation, just as if taking a picture with a camera. In the prior art, an image of a word is typically scanned as part of a page or line of a document and placed into a computer. Prior art scanners cannot selectively manipulate one word among other words on the same line of the document. Therefore, the user of the electronic dictionary cannot retrieve the definition of a word using the scanner as an input device. Similarly, a PIM user scans a calendar date via a conventional scanner,
You cannot query his or her reservation for a particular day. The image capture device 10 of the present invention can capture an image of a single word, several words, or a portion of the word by word length. This makes the apparatus more flexible in capturing images of small objects or small portions of an item.

【００３３】対象の光画像はレンズを通過して、２次元
画像センサ８８上に投影される。好ましい実施形態で
は、２次元画像センサ８８は、例えば、対象の光学画像
信号を２次元指向アナログ画像信号に変換する電荷結合
素子（ＣＣＤ）光学画像センサ・モジュールを含む。こ
の変換プロセスは当技術分野では周知である。次いでこ
のアナログ画像信号が、アナログ／デジタル変換器９０
でデジタル画像信号に変換される。次いでデジタル画像
信号がＤＳＰ９２で処理され、対象の画像を表示するデ
ィスプレイ２８に送信される。これらのプロセスは全て
実時間で実行され、したがって、ディスプレイ上に示さ
れる対象の画像はレンズの範囲内にある対象の画像を反
映する。ユーザは、装置が取り込んだ対象の画像をディ
スプレイ上で実時間で見ることができる。ユーザは、彼
または彼女がさらに情報を望む対象をディスプレイ上で
見た後で、スイッチ３４を押して単純にその対象のスナ
ップショット静止画像を撮る。ＤＳＰ９２はスイッチ３
４が押されたことを検出し、その画像が表示されている
対象が選択されたこと、およびその対象に関する情報を
取り出して表示すべきであることを認識する。The light image of the object passes through the lens and is projected onto the two-dimensional image sensor 88. In a preferred embodiment, the two-dimensional image sensor 88 includes, for example, a charge-coupled device (CCD) optical image sensor module that converts an optical image signal of interest into a two-dimensional directional analog image signal. This conversion process is well known in the art. Next, the analog image signal is supplied to the analog / digital converter 90.
Is converted into a digital image signal. The digital image signal is then processed by the DSP 92 and sent to a display 28 that displays the image of interest. All of these processes are performed in real time, so the image of the object shown on the display reflects the image of the object within range of the lens. The user can view the image of the object captured by the device on the display in real time. After the user sees the object on which he or she wants more information on the display, he simply presses switch 34 to take a snapshot still image of the object. DSP92 is switch 3
It is detected that 4 has been pressed, and it is recognized that the target on which the image is displayed is selected, and that information on the target should be retrieved and displayed.

【００３４】別の実施形態では、ユーザは、スイッチ３
４を「オン」位置に置くことによって画像取込み動作を
活動化することができる。活動化されると、画像取込み
装置はレンズの範囲内に配置されている対象の画像を取
り込む。この画像取込み装置のユーザは、取り込まれた
対象の画像をディスプレイ上で見ることができる。対象
のスナップショット静止画像は、スイッチを離すと撮ら
れる。In another embodiment, the user has the switch 3
The image capture operation can be activated by placing 4 in the "on" position. When activated, the image capture device captures an image of an object located within the lens. The user of the image capturing device can view the captured image of the target on the display. The snapshot still image of the subject is taken when the switch is released.

【００３５】２次元画像センサ８８では、ＲＯＭ９４中
に組み込まれた画像センサ・ドライバを使用する。好ま
しい実施形態では、画像センサ・ドライバは、ＤＳＰ９
２の動作を制御する命令シーケンスを含むコンピュータ
・プログラムの形態になる。２次元画像センサ８８は、
画像センサ・ドライバの命令に従って光画像信号を対象
のアナログ画像信号に変換する。The two-dimensional image sensor 88 uses an image sensor driver incorporated in the ROM 94. In a preferred embodiment, the image sensor driver is a DSP 9
2 in the form of a computer program including a sequence of instructions for controlling the operation of the computer. The two-dimensional image sensor 88
The optical image signal is converted into a target analog image signal according to an instruction of the image sensor driver.

【００３６】好ましい実施形態のＤＳＰ９２は、マイク
ロコンピュータ、マイクロ制御装置、ＣＩＳＣプロセッ
サ、またはＲＩＳＣプロセッサにすることもできる。プ
ロセッサのタイプは画像取込み装置１０の動作の結果に
影響を及ぼさない。ただし、その性能に影響を及ぼす可
能性はある。デジタル信号プロセッサは、多数を処理す
ることができることで知られている。信号処理技術では
計算量の多い多項式を大量に使用するので、デジタル信
号プロセッサは、この適用業務で使用する最良のタイプ
のプロセッサであると考えられる。The DSP 92 of the preferred embodiment may be a microcomputer, microcontroller, CISC processor, or RISC processor. The type of processor does not affect the results of operation of the image capture device 10. However, its performance may be affected. Digital signal processors are known for being able to process many. Due to the large amount of computationally intensive polynomials used in signal processing techniques, digital signal processors are considered to be the best type of processor to use in this application.

【００３７】図４は、シフト・ゲート１０６を介してＣ
ＣＤアナログ・シフト・レジスタ１０４に接続された画
像感知回路１０２を含む２次元画像センサ８８の概略図
である。画像感知回路１０２は、各素子が並列に結合さ
れた光ダイオード１０８およびコンデンサ１１０を含む
２次元マトリックス形式に配列された複数の感光素子
（フォトサイト（ｐｈｏｔｏｓｉｔｅｓ）と呼ばれる）
を含む。画像感知回路１０２の一端は画像取込み装置１
０のシャーシ・グラウンド１１２に接続され、画像感知
回路１０２のもう一端はシフト・ゲート１０６の片側に
結合される。シフト・ゲート１０６の反対側はＣＣＤア
ナログ・シフト・レジスタ１０４に結合される。シフト
・ゲート１０６を閉じると、ターゲットになった対象か
ら反射された光が光ダイオード１０８の状態を変化さ
せ、連続した光信号を生成する。連続した光信号はＣＣ
Ｄアナログ・シフト・レジスタ１０４にクロックインさ
れる。ＣＣＤアナログ・シフト・レジスタ１０４を介し
て、アナログ画像信号が生成される。FIG. 4 shows that C
FIG. 2 is a schematic diagram of a two-dimensional image sensor 88 including an image sensing circuit 102 connected to a CD analog shift register 104. The image sensing circuit 102 includes a plurality of photosensitive elements (referred to as photosites) arranged in a two-dimensional matrix including a photodiode 108 and a capacitor 110, each element coupled in parallel.
including. One end of the image sensing circuit 102 is the image capturing device 1
0, and the other end of the image sensing circuit 102 is coupled to one side of the shift gate 106. The other side of shift gate 106 is coupled to CCD analog shift register 104. When the shift gate 106 is closed, light reflected from the targeted object changes the state of the photodiode 108, producing a continuous optical signal. Continuous optical signal is CC
Clocked into D analog shift register 104. An analog image signal is generated via the CCD analog shift register 104.

【００３８】別法として、ＣＣＤアナログ・シフト・レ
ジスタの代わりにＣＭＯＳベース・アナログ・シフト・
レジスタを使用し、光ダイオード１０８の代わりにフォ
ト・レジスタを使用することもできる。こうした代用を
行うと、ＣＣＤベース２次元画像センサ８８はＣＭＯＳ
ベース画像センサとなる。Alternatively, a CMOS based analog shift register may be used instead of a CCD analog shift register.
A resistor may be used, and a photo resistor may be used instead of the photodiode 108. When such a substitution is made, the CCD-based two-dimensional image sensor 88 becomes CMOS-based.
It becomes a base image sensor.

【００３９】次に図５を参照すると、好ましい実施形態
の装置１０が実行する画像取込み動作が示されている。
ユーザが画像取込み動作を活動化することが不可欠であ
るが、そのためにはユーザがカメラのスイッチを押して
画像取込み動作を活動化する必要がある（段階１１
４）。対象の光学画像はレンズ１８を通過し、カメラ８
６で検出される。光学画像信号は２次元アナログ画像信
号に変形される（段階１１６）。次いで、アナログ画像
信号はアナログ／デジタル変換器でデジタル画像信号に
変換される（段階１１８）。事前処理動作で、ＤＳＰ
は、フィルタリング・サブプロセスを介して受信したデ
ジタル画像信号中のノイズを除去し（段階１２０）、フ
ィルタ済み画像信号を生成する。Referring now to FIG. 5, the image capture operation performed by the preferred embodiment apparatus 10 is shown.
It is essential that the user activates the image capture operation, which requires the user to activate the image capture operation by pressing a switch on the camera (step 11).
4). The optical image of the object passes through the lens 18 and the camera 8
6 is detected. The optical image signal is transformed into a two-dimensional analog image signal (step 116). Next, the analog image signal is converted into a digital image signal by an analog / digital converter (step 118). In the pre-processing operation, DSP
Removes noise in the digital image signal received via the filtering sub-process (step 120) and generates a filtered image signal.

【００４０】次にＤＳＰは光学画像認識サブプロセスを
実行する。電子辞書では、本発明の認識動作は新しい光
学文字認識サブプロセス（段階１２４）であり、その詳
細な動作を図８に示す。ＤＳＰは次に、認識した単語を
データベース中の単語と突き合わせる、文字または単語
突合せサブプロセスを実行する（段階１２６）。好まし
い実施形態のこの突合せサブプロセスを、以下の図９に
示す。ＤＳＰ９２は、データベース中で一致した単語を
発見した後で、データベースからその単語に関係する情
報を取り出す（段階１２８）。取り出された情報および
データベース中で発見された単語が、次いで装置１０の
ディスプレイ２８上に表示される（段階１３０）。Next, the DSP performs an optical image recognition sub-process. In the electronic dictionary, the recognition operation of the present invention is a new optical character recognition sub-process (step 124), the detailed operation of which is shown in FIG. The DSP then performs a character or word matching sub-process that matches the recognized words with words in the database (step 126). This matching sub-process of the preferred embodiment is illustrated in FIG. 9 below. After finding a matching word in the database, DSP 92 retrieves information related to the word from the database (step 128). The retrieved information and the words found in the database are then displayed on display 28 of device 10 (step 130).

【００４１】当業者には周知の通り、好ましい実施形態
は、個人がある角度でペンを持つように快適にこの装置
を持つことができるように、装置と対象の間の角度の許
容範囲を大きくすることができる。さらに、好ましい実
施形態では、画像の座標を回転させることができるよう
にする新しいカメラの較正機能を実施することもでき、
この機能により、ユーザはさらに自由に対象に対するカ
メラの角度を決めることができるようになる。As is well known to those skilled in the art, the preferred embodiment increases the angular tolerance between the device and the object so that the individual can comfortably hold the device as if holding the pen at an angle. can do. Further, the preferred embodiment can also implement a new camera calibration function that allows the image coordinates to be rotated,
With this function, the user can further freely determine the angle of the camera with respect to the object.

【００４２】図６は、好ましい実施形態の光学文字認識
サブプロセスを示す流れ図である。このサブプロセス
は、光信号を表すデータ点を含み、ＸおよびＹ座標で表
されるマトリックスに配列された、フィルタ済みデジタ
ル画像信号のマトリックスを受信する段階から始まる
（段階１３２）。マトリックスの各素子は、Ｐ_xy（ｘ、
ｙ）の画素のグレースケール値を表す。（ｘ、ｙ）はマ
トリックス中の画素Ｐ_xyの位置である。FIG. 6 is a flowchart illustrating the optical character recognition sub-process of the preferred embodiment. The sub-process begins with receiving a matrix of filtered digital image signals, including data points representing the optical signal, arranged in a matrix represented by X and Y coordinates (step 132). Each element of the matrix is P _xy (x,
y) represents the gray scale value of the pixel. (X, y) is the position of the pixel _{Pxy in} the matrix.

【００４３】次に、グレースケール・フィルタ済みデジ
タル画像信号から、２値グレー形式の画像が生成される
（段階１３４）。このプロセスの基本動作は、グレース
ケール形式またはカラースケールの画像信号を、２値グ
レー形式に変換することである。この動作は、多様なグ
レーの陰影の画素を白または黒（あるいは０または１）
の画素に変化させる。すなわち画素は、黒いドットまた
はドットなしの画素のいずれかになる。様々なアルゴリ
ズムを使用して、どのようにしてグレースケールのデー
タを２値データに変形するかを決定することができる。
２値データを生成した後で、フィルタリング・プロセス
が実行され、画像中のドット・ノイズが除去される（段
階１３６）。このフィルタリング・サブプロセスの結果
として、より良好かつ鮮明な画像データ・マトリックス
が得られる。Next, an image in binary gray format is generated from the grayscale filtered digital image signal (step 134). The basic operation of this process is to convert a grayscale or color scale image signal into a binary gray format. This operation will cause the various gray shaded pixels to be white or black (or 0 or 1)
Pixel. That is, the pixels are either black dots or pixels without dots. Various algorithms can be used to determine how to transform the grayscale data into binary data.
After generating the binary data, a filtering process is performed to remove dot noise in the image (step 136). The result of this filtering sub-process is a better and sharper image data matrix.

【００４４】次の段階は、対象の画像ブロックを生成す
る段階である。画像ブロックを生成する前に、まず対象
の画像をサブパートに分割しなければならない。電子辞
書では、単語フレーミング・サブプロセスが、取り込ん
だ各単語の画像について実行される。単語フレーミング
・サブプロセスは、最初に単語の前後のスペースを検出
することによって実施される。単語中の各文字もまた単
語フレーミング・プロセスでフレーミングされ、各文字
は各文字間の切れ目を認識することによってフレーミン
グされる。これらのプロセスでは、単語フレーミングま
たは文字フレーミング・プロセスを介して、フィルタ済
み画像信号から単語画像ブロックまたは文字画像ブロッ
クを生成する（段階１３８）。この段階では、画像デー
タ・マトリックスからそれらの点の密度に従って複数の
画像ブロックを生成する。電子辞書では、これは単語中
の各文字を分割することを必要とするが、これは各文字
間のスペースを検出することによって達成することがで
きる。このプロセスの目的は、以下のプロセスでより容
易に認識することができる文字を得ることである。文字
画像ブロックから、ループ認識サブプロセスによって文
字を認識することができる（段階１４０）。基本的に、
このループ認識サブプロセスは、それ以前のサブプロセ
スで再生された各文字を認識し、各文字を正しい順序で
配置し、画像取込み装置が取り込んだ単語を再構成する
ことによって実行される。この段階のある態様をさらに
図７および図８に示す。それ以前のサブプロセス（段階
１３２から１４０）は、単語中の全ての文字が文字画像
ブロック中に置かれるまで続行する（段階１４２）。The next step is to generate a target image block. Before generating an image block, the target image must first be divided into subparts. In an electronic dictionary, a word framing subprocess is performed on each captured word image. The word framing sub-process is implemented by first detecting spaces before and after the word. Each character in the word is also framed in the word framing process, and each character is framed by recognizing the breaks between each character. These processes generate word or character image blocks from the filtered image signal via a word or character framing process (step 138). At this stage, a plurality of image blocks are generated from the image data matrix according to the density of those points. In electronic dictionaries, this requires breaking each character in a word, but this can be achieved by detecting the space between each character. The purpose of this process is to obtain characters that can be more easily recognized in the following process. Characters can be recognized from the character image block by a loop recognition sub-process (step 140). fundamentally,
This loop recognition sub-process is performed by recognizing each character reproduced in the previous sub-process, placing each character in the correct order, and reconstructing the words captured by the image capture device. Some aspects of this stage are further illustrated in FIGS. The previous sub-process (steps 132 to 140) continues until all the characters in the word have been placed in the character image block (step 142).

【００４５】ループ認識サブプロセスを介して文字を突
き合わせる上記の動作を図７にさらに示す。例として文
字「Ｚ」を使用すると、最初の段階はいくつかの特徴画
素を識別する段階である。この例では、ＣＰ−１１４
４、ＣＰ−２１４６、ＣＰ−３１４８、およびＣＰ
−４１５０の４つの特徴画素が識別された。次に、各
特徴画素のその他の特徴画素に対する相対位置が決定さ
れる。例えば、ＣＰ−２１４６はＣＰ−１１４４の
右側にあり、ＣＰ−３１４８はＣＰ−１１４４の下
にあり、ＣＰ−４１５０はＣＰ−１１４４の右下に
ある。次の段階は、文字中の各特徴画素で形成される角
度を識別する段階である。例えば、ＣＰ−３１４８の
角度はＣＰ−２１４６およびＣＰ−３１４８をつな
ぐ線とＣＰ−３１４８およびＣＰ−４１５０をつな
ぐ線との間の角度である。この例では、この角度は約６
０度である。次の段階は、２つの特徴画素の間の距離を
決定する段階である。この距離は１つの所定距離に対す
る比で表すことができる。この比は、様々なフォント・
サイズの文字をサポートするために好ましい。特徴画素
の相対関係が分かった後で、その文字の特徴とホスト・
システムのテンプレート・ライブラリに事前に記憶され
た文字の特徴とを比較して、突合せプロセスを実行す
る。テンプレート・ライブラリはデータベースの別形態
である。The above operation of matching characters through the loop recognition subprocess is further illustrated in FIG. Using the letter "Z" as an example, the first step is to identify some feature pixels. In this example, CP-1 14
4, CP-2 146, CP-3 148, and CP
-4 150 four feature pixels were identified. Next, the relative position of each characteristic pixel with respect to the other characteristic pixels is determined. For example, CP-2 146 is to the right of CP-1 144, CP-3 148 is below CP-1 144, and CP-4 150 is to the lower right of CP-1 144. The next step is to identify the angle formed by each feature pixel in the character. For example, the angle of CP-3 148 is the angle between the line connecting CP-2 146 and CP-3 148 and the line connecting CP-3 148 and CP-4 150. In this example, this angle is about 6
0 degrees. The next step is to determine the distance between the two feature pixels. This distance can be expressed as a ratio to one predetermined distance. This ratio is different for various fonts
Preferred to support size characters. After the relative relationship between the characteristic pixels is known, the character
The matching process is performed by comparing the character features previously stored in the system's template library. Template libraries are another form of database.

【００４６】図８を参照すると、図６に段階１４０で示
される文字ループ認識サブプロセスの流れ図が示されて
いる。電子辞書のループ認識サブプロセスでは、最初に
取り込まれた文字画像の字画を識別する（段階１５
２）。例として図７の文字「Ｚ」を使用すると、第１字
画がＣＰ１１４４からＣＰ２１４６、第２字画がＣ
Ｐ２１４６からＣＰ３１４８、第３字画がＣＰ３
１４８からＣＰ４１５０となる３つの字画を識別する
ことができる。次の段階は、各字画の相対長さを決定す
る段階である（段階１５４）。文字「Ｚ」では、この段
階で、各字画の相対長さ、すなわちＣＰ１１４４とＣ
Ｐ２１４６の間の距離となる第１字画の相対長さ、Ｃ
Ｐ２１４６とＣＰ３１４８の間の距離となる第２字
画の相対長さ、ＣＰ３１４８とＣＰ４１５０の間の
距離となる第１字画の相対長さを決定する。次の段階
は、画像取込み装置によって取り込まれた文字画像の画
数を決定する段階である（段階１５６）。文字「Ｚ」で
は、画数は３となり、これらはそれぞれ上記で第１、第
２、および第３として識別したものである。次の段階
は、つながった２つの字画の間の角度を決定する段階で
ある（段階１５８）。上記と同様に、特徴画素ＣＰ３
１４８での角度は第２字画と第３字画の間の角度であ
る。この例では、第２字画および第３字画はつながった
２つの字画である。同様に、上記で識別した第１字画お
よび第２字画も、文字「Ｚ」中のつながった２つの字画
である。つながった２つの字画、すなわち第１字画およ
び第２字画は、特徴画素ＣＰ２１４６で、やはりこの
段階で決定される角度を形成する。次の段階は各字画の
向きを決定する段階である（段階１６０）。この場合も
例として文字「Ｚ」を使用すると、第１字画は通常は左
から右に向かって書かれ、第２字画は通常は右上から左
下に向かって書かれ、第３字画は通常は左から右に向か
って書かれる。次に、上記段階で決定された情報を含む
取り込まれた文字画像の特徴情報を、字画特徴から生み
出す（段階１６２）。特徴情報は、各字画の相対長さ、
文字画像の画数、つながった２つの字画の間に形成され
る角度、および各字画の向きを含む。次いで取り込まれ
た文字画像の特徴情報は、テンプレート・ライブラリ中
の文字の特徴と比較される（段階１６４）。テンプレー
ト・ライブラリは、認識すべき明確な図形記号のそれぞ
れに対するプロトタイプ画像をコンピュータ中に含むデ
ータベースである。取り込まれた文字画像の特徴情報と
テンプレート・ライブラリ中の文字の特徴とを比較した
後で、完全な突合せが発見されたかどうかを決定するこ
とが不可欠である（段階１６６）。完全な一致が発見さ
れた場合には、突合せサブプロセスの結果として発見さ
れた文字を参照文字として使用することになる（段階１
６８）。完全な突合せが発見されなかった場合には、取
り込まれた文字画像の特徴情報に最も近い特徴を有する
文字を参照文字として選択する（段階１７０）。Referring to FIG. 8, a flow chart of the character loop recognition sub-process shown at step 140 is shown in FIG. In the electronic dictionary loop recognition sub-process, the stroke of the first captured character image is identified (step 15).
2). Using the character "Z" in FIG. 7 as an example, the first stroke is CP1 144 to CP2 146, and the second stroke is C.
From P2 146 to CP3 148, the third stroke is CP3
Three strokes from 148 to CP4 150 can be identified. The next step is to determine the relative length of each stroke (step 154). In the character “Z”, at this stage, the relative length of each stroke, ie, CP1 144 and C
The relative length of the first stroke, which is the distance between P2 146, C
The relative length of the second stroke, which is the distance between P2 146 and CP3 148, and the relative length of the first stroke, which is the distance between CP3 148 and CP4 150, are determined. The next step is to determine the number of strokes of the character image captured by the image capture device (step 156). For the letter "Z", the number of strokes is three, identified above as first, second, and third, respectively. The next step is to determine the angle between the two connected strokes (step 158). As described above, the characteristic pixel CP3
The angle at 148 is the angle between the second stroke and the third stroke. In this example, the second stroke and the third stroke are two connected strokes. Similarly, the first stroke and the second stroke identified above are also two connected strokes in the character “Z”. The two connected strokes, the first stroke and the second stroke, form at the feature pixel CP2 146 the angle also determined at this stage. The next step is to determine the orientation of each stroke (step 160). Again, using the letter "Z" as an example, the first stroke is usually written from left to right, the second stroke is usually written from upper right to lower left, and the third stroke is usually written left to right. From right to right. Next, feature information of the captured character image including the information determined in the above step is generated from the stroke feature (step 162). The feature information is the relative length of each stroke,
It includes the number of strokes of a character image, the angle formed between two connected strokes, and the orientation of each stroke. The captured character image feature information is then compared to the character features in the template library (step 164). The template library is a database that contains in the computer a prototype image for each distinct graphic symbol to be recognized. After comparing the feature information of the captured character image with the features of the characters in the template library, it is essential to determine whether a perfect match has been found (step 166). If an exact match is found, the character found as a result of the matching sub-process will be used as the reference character (step 1).
68). If a perfect match is not found, a character having a characteristic closest to the characteristic information of the captured character image is selected as a reference character (step 170).

【００４７】画像取込み装置が取り込んだ単語中の全て
の文字が識別された後、次のサブプロセスは、その単語
に関係する情報をデータベースから取り出す段階であ
る。識別された単語の情報を取り出すサブプロセスを、
図９の流れ図に示す。この単語突合せサブプロセスは、
メモリから認識した単語を取り出す段階から始まる（段
階１７２）。認識した単語を取り出した後、次の手順は
認識した単語と一致する単語をデータベース中で発見す
ることである（段階１７４）。次いで、一致する単語が
データベース中で発見されたか否かを決定する（段階１
７６）。取り込まれた単語がデータベース中で発見され
るか否かについての何らかの情報を提供することが不可
欠である。一致する単語がデータベース中で発見された
場合には、一致する単語に関係する定義および情報がデ
ータベースから取り出される（段階１７８）。そうでな
い場合には、取り込まれた単語に最も近いデータベース
中の単語が選択され、その定義および情報がデータベー
スから取り出される（段階１８０）。データベースから
情報を取り出した後で、単語、その定義、およびそれが
関係する情報が、装置のディスプレイ上に表示される
（段階１８２）。After all the characters in the word captured by the image capture device have been identified, the next sub-process is to retrieve information related to the word from the database. A sub-process to retrieve the information of the identified word,
This is shown in the flowchart of FIG. This word matching subprocess:
Beginning with retrieving the recognized word from memory (step 172). After retrieving the recognized word, the next step is to find a word in the database that matches the recognized word (step 174). It is then determined whether a matching word has been found in the database (step 1).
76). It is essential to provide some information about whether the captured words are found in the database. If a matching word is found in the database, definitions and information relating to the matching word are retrieved from the database (step 178). Otherwise, the word in the database that is closest to the captured word is selected and its definition and information are retrieved from the database (step 180). After retrieving the information from the database, the word, its definition, and the information to which it pertains are displayed on the display of the device (step 182).

【００４８】特定の実施形態に関連して本発明を特に示
し、説明したが、形状および細部の様々な変更および修
正を加えることができることは当業者には理解されるで
あろう。したがって、頭記の特許請求の範囲は、こうし
た全ての変更および修正を、本発明の趣旨および範囲内
にあるものとしてカバーするものとする。While the invention has been particularly shown and described with respect to particular embodiments, those skilled in the art will recognize that various changes and modifications in form and detail may be made. It is, therefore, intended that the appended claims cover all such changes and modifications as fall within the spirit and scope of the invention.

[Brief description of the drawings]

【図１】現在好ましい実施形態の画像取込み／メッセ
ージ表示装置の斜視図である。FIG. 1 is a perspective view of an image capture / message display device of the presently preferred embodiment.

【図２】別の好ましい実施形態の画像取込み／メッセ
ージ表示装置の斜視図である。FIG. 2 is a perspective view of another preferred embodiment image capture / message display device.

【図３】好ましい実施形態の主要な構成部品を示すブ
ロック図である。FIG. 3 is a block diagram showing the main components of the preferred embodiment.

【図４】好ましい実施形態のＣＣＤベース画像センサ
の概略図である。FIG. 4 is a schematic diagram of a CCD-based image sensor of a preferred embodiment.

【図５】好ましい実施形態の画像取込み動作およびメ
ッセージ表示動作の流れ図である。FIG. 5 is a flowchart of an image capturing operation and a message displaying operation of the preferred embodiment.

【図６】好ましい実施形態の画像認識プロセスの流れ
図である。FIG. 6 is a flow chart of the image recognition process of the preferred embodiment.

【図７】好ましい実施形態の突合せ動作における対象
の特徴画素の選択の１例を示す図である。FIG. 7 is a diagram illustrating an example of selection of a target feature pixel in a matching operation according to a preferred embodiment;

【図８】好ましい実施形態のループ認識サブプロセス
の突合せ動作の流れ図である。FIG. 8 is a flowchart of the matching operation of the loop recognition sub-process of the preferred embodiment.

【図９】情報検索動作の流れ図である。FIG. 9 is a flowchart of an information search operation.

[Explanation of symbols]

１０装置１８レンズ２８ディスプレイ３４〜４０スイッチ８４対象８６カメラ８８２次元画像センサ９０アナログ／デジタル（Ａ／Ｄ）変換器９２デジタル信号プロセッサ（ＤＳＰ）９４読取り専用メモリ（ＲＯＭ）９６ランダム・アクセス・メモリ（ＲＡＭ） Reference Signs List 10 device 18 lens 28 display 34-40 switch 84 object 86 camera 88 two-dimensional image sensor 90 analog / digital (A / D) converter 92 digital signal processor (DSP) 94 read-only memory (ROM) 96 random access memory (RAM)

─────────────────────────────────────────────────────
────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成１０年８月１８日（１９９８．８．１
８）[Submission date] August 18, 1998 (1998.8.1)
8)

【手続補正１】[Procedure amendment 1]

【補正対象書類名】図面[Document name to be amended] Drawing

【補正対象項目名】全図[Correction target item name] All figures

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【図１】 FIG.

【図２】 FIG. 2

【図３】 FIG. 3

【図４】 FIG. 4

【図７】 FIG. 7

【図５】 FIG. 5

【図６】 FIG. 6

【図８】 FIG. 8

【図９】 FIG. 9

───────────────────────────────────────────────────── フロントページの続き (72)発明者デイヴィッド・ディ・エス・ホ台湾・タイペイ・シー−リンディストリクト・ホウ−カンストリート・66 (72)発明者フレッド・エイチ・ワイ・チェン台湾・タイペイカウンティ・チャンホシティ・バオ−シエンロード・レーン 106・38番・４ティエイチエフエル (72)発明者サムソン・エックス・エス・ワン中華人民共和国・シャンハイ・ギ−ドンロード・ラン 557・11番・ルーム 202 (72)発明者リン・リー中華人民共和国・サンシプロヴィンス・クシアン・イェンターディストリクト・チンソンロード・７Ｆターム(参考） 5B047 AA01 AA30 AB01 BA03 BB04 BC05 BC15 CB09 CB16 CB22 EA01 EB20 5B050 AA06 BA16 BA20 CA04 DA06 EA07 EA15 EA18 FA02 FA09 FA19 GA04 GA08 5B075 ND07 PP04 PP10 PQ02 PQ32 PR06 QM08 UU01 ──────────────────────────────────────────────────の Continuing on the front page (72) Inventor David D.S.H., Taiwan, Taipei She-Lin District Hou-Khan Street 66 (72) Inventor, Fred H.W.Cheng, Taiwan, Taipei County Chang Ho City Bao-Xien Road Lane 106.38.4 T.F.F. (72) Inventor Samson XS One Shanghai Guidong Road Run 557.11 Room 202 (72) Inventor Lin Lee China, Sansi Province, Cusian, Jenter District, Jinson Road, 7F Term (Reference) 5B047 AA01 AA30 AB01 BA03 BB04 BC05 BC15 CB09 CB16 CB22 EA01 EB20 5B050 AA06 BA1 6 BA20 CA04 DA06 EA07 EA15 EA18 FA02 FA09 FA19 GA04 GA08 5B075 ND07 PP04 PP10 PQ02 PQ32 PR06 QM08 UU01

Claims

[Claims]

1. Capture and display one or more images of one or more selected objects and process the selected images to identify the selected images as specific objects having codes recognizable by the device. An apparatus for identifying, retrieving and displaying information related thereto, comprising a processor, a memory storage device containing a display and information relating to a plurality of identifiable objects, and a switch for taking a snapshot still image of the objects. A camera that generates a signal representing a still image of the object, wherein the camera constantly generates a signal representing the image of the object selected by the user and passes the signal to the processor, and the processor generates the signal. The processor constantly displays the image that it represents in real time, and when the switch is activated to take a still image of the selected object, the processor Identifies an image as a specific object, retrieve information related thereto device.

2. A lens for sensing an optical image of one or more selected objects, receiving an optical image signal of the object via the lens positioned at the back of the lens, and generating an analog image signal of the object. Responsive to an analog image signal of interest from the image sensor.
An analog-to-digital converter for generating a digital image signal of the object, and a processor responsive to the digital image signal of the object generated by the analog-to-digital converter for displaying a digital image of the object on a display. Item 10. The apparatus according to Item 1.

3. The apparatus according to claim 2, wherein the image sensor is a CCD-based image sensor.

4. The apparatus according to claim 2, wherein the image sensor is a CMOS based image sensor.

5. The apparatus according to claim 1, which is a personal information management system.

6. The apparatus according to claim 1, wherein the display is a flat panel display.

7. The device according to claim 1, wherein the display is a liquid crystal display.

8. A method of capturing an image of a subject, displaying said image by a device, identifying it as a particular subject, extracting relevant information and displaying the same on a device, comprising: Capturing a signal and generating a signal representative of the image of the subject; processing the signal to generate an image represented thereby and displaying the image; and activating a user-controllable switch. Generating a still image; identifying the still image as a particular object having a code recognizable by a processor in the device; recognizing the particular object via an image recognition sub-process; Matching the recognized object with an object previously stored in a memory storage device in the device; and Method comprising the steps of retrieving information from the memory storage device of the device, and displaying the information on a device display.

9. A method for generating a grayscale digital image data matrix from a digital image signal of a specific object, and generating a digital image signal of a specific object in a binary gray format from the grayscale digital image signal. 9. The method of claim 8, further comprising: framing a particular object from a digital image signal in the form of a binary gray signal of the particular object; and generating an image block of interest from the frame of interest.

10. A sub-process for matching a recognized object to an object in a database, identifying the matched object, dividing the object into a plurality of sub-parts, wherein each feature pixel is associated with another feature pixel of the sub-part. Determining the position; determining the angle formed by each feature pixel of the subpart; and determining the distance between the two feature pixels of the subpart, wherein the feature depends on the feature of each feature pixel. Identifying a plurality of feature pixels in each subpart; comparing features of the subpart with other subparts in the database; subparts having features matching the subpart of interest;
Or selecting from the database a subpart having features closest to the features of the subpart of interest.

11. The method according to claim 8, wherein the recognition sub-process is an optical character recognition process.

12. Capture and display one or more images of one or more selected objects and process the selected images to convert the selected images as specific words having codes that the device can recognize. An electronic dictionary device for identifying, retrieving and displaying information related thereto, comprising: a processor; a memory storage device including a display and information relating to a plurality of identifiable words; and a switch for taking a snapshot image of the words. A camera that generates a signal representing a still image of the word, the camera constantly generating a signal representing an image of the word selected by the user and passing this signal to the processor, the processor comprising: When the image represented by the signal is constantly displayed in real time and the switch is activated to take a still image of the selected word, the processor There identifying the still picture as a particular word, the electronic dictionary device to retrieve the information related thereto.

13. A lens for sensing an optical image of one or more selected words, and receiving an optical image signal of the word via the lens positioned at the back of the lens to generate an analog image signal of the word. An image sensor operable to respond to an analog image signal of a word from the image sensor.
An analog-to-digital converter for generating a digital image signal of the word, and a processor for displaying a digital image of the word on a display in response to the digital image signal of the word generated by the analog-to-digital converter. Item 13. The electronic dictionary according to item 12.

14. The electronic dictionary according to claim 13, wherein the image sensor is a CCD-based image sensor.

15. The electronic dictionary according to claim 13, wherein the image sensor is a CMOS-based image sensor.

16. A personal information management system according to claim 1.
3. The device according to 2.

17. The apparatus according to claim 12, wherein the display is a flat panel display.

18. The device according to claim 12, wherein the display is a liquid crystal display.

19. A method of capturing an image of a word, displaying the image by an electronic dictionary, identifying the image as a specific word, extracting information related to the word, and displaying the information on the electronic dictionary, wherein the word is constantly displayed. Capturing an optical image signal and generating a signal representing the image of the word; processing the signal to generate an image represented thereby and displaying the image; Generating a still image of the word after being coded, identifying the still image as a particular word having a code recognizable by a processor in the device, and recognizing the particular word via an image recognition sub-process. Matching a recognized word with a word previously stored in a memory storage device in the device; Method comprising the steps of retrieving information related to words from the memory storage device of the device, and displaying the information on the electronic dictionary display.

20. Generating a grayscale digital image data matrix from a digital image signal of a specific word, and generating a digital image signal of a specific word in a binary gray format from the grayscale digital image signal. 20. The method of claim 19, further comprising: framing a particular word from a digital image signal in the form of a binary gray signal of the particular word; and generating an image block of the word from the frame of the word.

21. A sub-process of matching a recognized word to a word in a database and identifying the matched word comprises: dividing the word into a plurality of subparts; Determining the position, determining the angle formed by each feature pixel of the subpart, and determining the distance between the two feature pixels of the subpart, the features depending on the features of each feature pixel. Identifying a plurality of feature pixels in each subpart; comparing features of the subpart with other subparts in the database; subparts having features matching the word subparts;
Or selecting from the database a subpart having features closest to the features of the word subpart.

22. The method according to claim 19, wherein the recognition sub-process is an optical character recognition process.