JP2000348142A

JP2000348142A - Character recognizing device, method therefor and recording medium for recording program executing the method

Info

Publication number: JP2000348142A
Application number: JP11160404A
Authority: JP
Inventors: Mai Araki; 麻衣荒木; Haruhiko Kojima; 治彦児島; Hidekatsu Kuwano; 秀豪桑野
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1999-06-08
Filing date: 1999-06-08
Publication date: 2000-12-15

Abstract

PROBLEM TO BE SOLVED: To realize quick word collation of high precision at the time of recognizing a character string included in a picture of a signboard or the like. SOLUTION: A picture including character information like an enterprise name, a telephone number, or a personal name is inputted (2), and character pictures are segmented from the inputted picture, and segmented character pictures are recognized to output a recognition result character code string (3 to 5). Position information of the place where the picture was inputted is acquired (6), and the range of information in a word dictionary 1 used for word collation is narrowed down on the basis of acquired position information (7). Narrowed-down information in the word dictionary 1 is used to collate words in the word dictionary 1 with the recognition result character code string (8), and a word of which the number of characters coinciding with the recognition result characters is largest as the word collation result is outputted as the character recognition result (9).

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は，屋外において入力
された情景画像中の企業名や電話番号，個人名等を認識
するための効果的な単語照合を行う文字認識技術に関す
るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition technology for performing effective word matching for recognizing a company name, a telephone number, a personal name and the like in a scene image input outdoors.

【０００２】[0002]

【従来の技術】看板などの屋外の対象に対しては，高い
文字認識精度が期待できないため，従来の単語照合方法
では，単語照合を行っても一意に絞り込むのが困難であ
った。そのため，例えば，「荒木，宮本，鈴木，中村，
杉村，“複数情報の統合による看板文字認識”，１９９
８信学会ソ大会Ｄ−１２−２５」，特願平１０−２２２
９１３号（文字認識装置）のように，複数文字列の文字
認識結果を統合することによって単語照合を行うものが
提案されているが，単語辞書の規模が大きい場合には，
絞り込みが困難となってしまう場合が生じる。2. Description of the Related Art Since a high character recognition accuracy cannot be expected for an outdoor object such as a signboard, it is difficult to uniquely narrow down the words by the conventional word matching method. Therefore, for example, "Araki, Miyamoto, Suzuki, Nakamura,
Sugimura, "Signboard Character Recognition by Integration of Multiple Information", 199
8 IEICE D-12-25 ", Japanese Patent Application No. 10-222.
No. 913 (character recognition device) has been proposed that performs word matching by integrating the character recognition results of multiple character strings. However, when the size of a word dictionary is large,
In some cases, narrowing down becomes difficult.

【０００３】[0003]

【発明が解決しようとする課題】そのため，低い精度の
文字認識結果を用いた場合の単語照合処理の効果を上げ
るためには，誤認識文字による誤照合文字列を減少させ
るために，単語辞書の絞り込みが必要である。Therefore, in order to improve the effect of the word matching process when using a low-accuracy character recognition result, in order to reduce the number of erroneous matching character strings due to erroneously recognized characters, a word dictionary is required. It is necessary to narrow down.

【０００４】本発明は上記事情に鑑みてなされたもの
で，その目的とするところは，位置情報が含まれる単語
辞書と，画像を取得した場所についての位置情報を利用
することにより，低品質な文字認識結果に対する効果的
な単語照合方法を実現し，精度良く正解単語を得る文字
認識装置を提供することにある。The present invention has been made in view of the above circumstances, and has as its object the use of a word dictionary including position information and position information on a place where an image has been acquired, thereby realizing low quality. An object of the present invention is to provide a character recognition device that realizes an effective word matching method for character recognition results and obtains correct words with high accuracy.

【０００５】[0005]

【課題を解決するための手段】上記の課題を解決するた
めに，本発明は，位置情報の利用に着目し，位置情報と
結び付けられた単語辞書を用いることで単語照合に使用
する単語辞書の範囲を絞り込み，単語照合精度を向上さ
せる。SUMMARY OF THE INVENTION In order to solve the above problems, the present invention focuses on the use of location information, and uses a word dictionary linked to the location information to form a word dictionary used for word matching. Narrow the range and improve word matching accuracy.

【０００６】そのための手段として，企業名や電話番
号，個人名と住所とが対になって格納されている単語辞
書と，企業名や電話番号を含む画像を入力する画像入力
部と，入力された画像から文字列領域を切り出す文字列
切り出し部と，全文字列ついて，一文字ずつの文字画像
領域を求めて文字画像を切り出す文字切り出し部と，切
り出された文字画像の認識を行って認識結果文字コ―ド
列を出力する文字認識部と，画像が入力された場所の位
置情報をＰＨＳ（Personal Handyphone System）やＧＰ
Ｓ(Global Positioning System）を用いて取得する位置
情報取得部と，取得された位置情報を基に単語辞書内の
位置情報で絞り込みを行い，単語照合に使用する単語辞
書の部分を選択する単語辞書選択部と，文字認識部によ
り得られた文字コード列と，単語辞書選択部により得ら
れた単語辞書との間で単語照合を行う単語照合部と，単
語照合部により得られた結果の中でもっとも一致文字数
の多い結果を出力する出力部とから構成されることを特
徴とする。As means for this, a word dictionary in which company names, telephone numbers, personal names and addresses are stored in pairs, and an image input section for inputting images containing company names and telephone numbers, are provided. A character string cutout unit that cuts out a character string area from the extracted image, a character cutout unit that cuts out a character image for each character string in a character image area for all character strings, and a recognition result character that performs recognition of the cutout character image. A character recognition unit that outputs a code sequence, and position information of a place where an image is input are stored in a PHS (Personal Handyphone System) or GP.
A position information acquisition unit that acquires using the S (Global Positioning System), and a word dictionary that narrows down the position information in the word dictionary based on the acquired position information and selects a part of the word dictionary used for word matching A word matching unit that performs word matching between a selecting unit, a character code string obtained by a character recognizing unit, and a word dictionary obtained by a word dictionary selecting unit, and a result obtained by the word matching unit. And an output unit for outputting a result having the largest number of matching characters.

【０００７】本発明の作用は以下のとおりである。本発
明では，位置情報を利用して単語辞書の選択を行うこと
により，より少ない単語辞書と文字認識結果との照合を
行うことで，誤照合を減少させることが可能となる。ま
た，同じ名前の企業名や個人名が存在した場合や，市外
局番の書かれていない電話番号を認識しようとした場合
でも，位置情報の利用によって正解を得ることが可能と
なる。The operation of the present invention is as follows. According to the present invention, by selecting a word dictionary using position information, it is possible to reduce erroneous matching by comparing less word dictionaries with character recognition results. In addition, even when a company name or an individual name having the same name exists, or when an attempt is made to recognize a telephone number without an area code, a correct answer can be obtained by using the location information.

【０００８】[0008]

【発明の実施の形態】図１は，本発明の実施の形態に係
る文字認識装置の全体概要を示すブロック図である。本
実施の形態の文字認識装置は，単語辞書１，画像入力部
２，文字列切り出し部３，文字切り出し部４，文字認識
部５，位置情報取得部６，単語辞書選択部７，単語照合
部８，出力部９から構成されている。FIG. 1 is a block diagram showing an overall outline of a character recognition device according to an embodiment of the present invention. The character recognition device according to the present embodiment includes a word dictionary 1, an image input unit 2, a character string cutout unit 3, a character cutout unit 4, a character recognition unit 5, a position information acquisition unit 6, a word dictionary selection unit 7, a word collation unit. And an output unit 9.

【０００９】単語辞書１には，企業名と住所，電話番号
と住所，あるいは個人名と住所などが対になって格納さ
れている。単語辞書１としては例えばタウンページ等の
電話帳データベースを用いることが考えられる。The word dictionary 1 stores a company name and an address, a telephone number and an address, or a personal name and an address in pairs. As the word dictionary 1, for example, a telephone directory database such as a town page may be used.

【００１０】画像入力部２では，企業名や電話番号，個
人名を含む画像の入力を行う。これは，例えば看板や駅
看板，ポスター等のプレート型の文字情報を含む画像で
ある。文字列切り出し部３では，画像入力部２で入力さ
れた画像から，文字列画像を切り出す。文字サイズの変
動が小さく，縦あるいは横一列に並んだ文字のかたまり
を一文字列として切り出す。ここでは，例えば駅看板な
ら看板中に書かれた広告主名，電話番号，住所等の文字
列が，ポスターならポスターのタイトルや連絡先等の複
数行にまたがる文字列がすべて切り出されることにな
る。The image input unit 2 inputs an image including a company name, a telephone number, and a personal name. This is an image including plate-type character information such as a signboard, a station signboard, and a poster. The character string cutout unit 3 cuts out a character string image from the image input by the image input unit 2. A chunk of characters arranged in a vertical or horizontal line with small variations in character size is cut out as one character string. Here, for example, a character string such as an advertiser name, telephone number, address, etc. written on a signboard for a station sign, and a character string extending over multiple lines such as a poster title and contact information for a poster are all cut out. .

【００１１】文字切り出し部４では，文字列切り出し部
３で切り出された文字列画像のすべてについて，１文字
分ずつ文字画像を切り出し，文字サイズの算出を行う。
この文字切り出し部４により，背景とのコントラストな
どによって切り出された文字列画像の矩形領域から，さ
らに１文字単位の文字画像が切り出されることになる。
次に，文字認識部５では，全文字列について，切り出さ
れた各文字画像の認識を行い，認識結果文字コード列を
出力する。The character cutout unit 4 cuts out character images one by one for each of the character string images cut out by the character string cutout unit 3, and calculates the character size.
The character cutout unit 4 further cuts out a character image in units of one character from the rectangular area of the character string image cut out based on the contrast with the background or the like.
Next, the character recognizing unit 5 recognizes each of the cut-out character images for all the character strings, and outputs a recognition result character code string.

【００１２】文字列切り出し部３，文字切り出し部４お
よび文字認識部５で使用する方法としては，例えば，文
字線とその背景の濃度コントラストが高い，文字の外接
矩形が正方形に近いものが多いなどの文字の普遍的な性
質を利用することによって，雑音が多く条件の変化の激
しい情景画像から文字を抽出，認識する方法が提案され
ている（大谷淳，塩昭夫，“情景画像からの文字パター
ン抽出と認識”，電子情報通信学会論文誌Ｄ Vol.J71-D
No.6 pp.1037-1047）。As a method used in the character string extracting section 3, the character extracting section 4, and the character recognizing section 5, for example, the density contrast between the character line and its background is high, and the circumscribed rectangle of the character is often close to a square. A method of extracting and recognizing characters from scene images with noisy conditions and drastic changes in conditions by using the universal properties of characters has been proposed (Jun Otani, Akio Shio, "Character Patterns from Scene Images" Extraction and recognition ”, IEICE Transactions D Vol.J71-D
No.6 pp.1037-1047).

【００１３】なお，文字列切り出し部３を省略して，画
像入力部２により入力した情景画像からダイレクトに文
字を切り出してもよい。これには，例えばエッジの特徴
を利用した手法（桑野，倉掛，小高，“映像データ検索
のためのテロップ文字抽出法”，信学技報ＰＭＲＵ96,
93-103, pp.39-46）を適用することができる。The character string cutout unit 3 may be omitted, and characters may be cut out directly from the scene image input by the image input unit 2. This includes, for example, methods using edge features (Kuwano, Kurakake, Odaka, “Ticker character extraction method for video data search”, IEICE Technical Report PMRU96,
93-103, pp.39-46) can be applied.

【００１４】位置情報取得部６では，画像入力がなされ
た地点の位置情報を，ＰＨＳあるいはＧＰＳ等を用いて
取得する。位置情報には誤差が含まれるため，その誤差
の最大値も検出しておく。また，位置情報としては緯度
経度情報が取得される。The position information acquiring section 6 acquires the position information of the point where the image is input by using PHS or GPS. Since the position information includes an error, the maximum value of the error is also detected. In addition, latitude and longitude information is acquired as position information.

【００１５】次に単語辞書選択部７では，位置情報取得
部６により取得された位置情報およびその誤差の範囲を
利用して，単語辞書１の中で照合に使用する部分の選択
を行う。これにより，単語照合に使用するレコードの数
を，多数のレコードの中から数十件，あるいは数百件の
レコードに絞り込むことが可能となる。Next, the word dictionary selecting section 7 selects a portion to be used for collation in the word dictionary 1 using the position information acquired by the position information acquiring section 6 and the range of the error. As a result, the number of records used for word matching can be narrowed down to tens or hundreds of records from many records.

【００１６】単語辞書１の選択は，例えば単語辞書１内
の住所情報を緯度経度情報に変換したものと，位置情報
取得部６で検出された緯度経度情報を用いて，誤差を含
めた緯度経度の範囲に収まる範囲の辞書の選択を行うと
いった方法が考えられる。例えばインターネット版のタ
ウンページ・データベースには，緯度経度情報が付与さ
れているので（島健一，高橋克巳，三浦信幸，“インタ
ーネット版マルチメディア電話帳の構築”，オンライン
プロシーディングス Japan World Wide Web Conference
'95(1995)），それを単語辞書１として利用することに
より，緯度経度情報を用いた単語辞書情報の絞り込みを
行うこともできる。The word dictionary 1 is selected, for example, by converting address information in the word dictionary 1 into latitude / longitude information and using the latitude / longitude information detected by the position information acquisition unit 6 to obtain a latitude / longitude including an error. For example, there is a method of selecting a dictionary in a range falling within the range. For example, the Internet version of the town page database is provided with latitude and longitude information (Kenichi Shima, Katsumi Takahashi, Nobuyuki Miura, "Building an Internet Multimedia Phonebook", Online Proceedings Japan World Wide Web Conference
'95 (1995)), by using it as the word dictionary 1, it is possible to narrow down the word dictionary information using the latitude and longitude information.

【００１７】単語照合部８では，文字認識部５で得られ
た文字コード列と，単語辞書選択部７で得られた単語辞
書とを用いて単語照合を行い，一致文字数をカウントす
る。例えば，認識対象が一文字列の看板の場合には，単
語辞書の企業名との単語照合を行う。また，複数の文字
列が存在する認識対象の場合には，複数の文字列と複数
のレコードとの間で単語照合を行うため，すべての組み
合わせにおいて総当たりで単語照合を行い，一致文字数
が最大となる組み合わせを求める。あるいは，書式解析
等を利用して単語照合を行う文字列の認識結果を選択
し，その一致文字数を算出してもよい。The word matching section 8 performs word matching using the character code string obtained by the character recognition section 5 and the word dictionary obtained by the word dictionary selection section 7 and counts the number of matching characters. For example, when the recognition target is a signboard of one character string, word matching with a company name in a word dictionary is performed. In addition, in the case of a recognition target having a plurality of character strings, word matching is performed between the plurality of character strings and a plurality of records. Find the combination Alternatively, a recognition result of a character string to be subjected to word matching using format analysis or the like may be selected, and the number of matching characters may be calculated.

【００１８】ここで，単語照合を行う文字列の選択方法
としては，「荒木，宮本，鈴木，中村，杉村，“複数情
報の統合による看板文字認識”，１９９８信学会ソ大会
Ｄ−１２−２５」において提案されている看板の特徴を
利用した方法を用いることができる。また，単語照合方
法としては，例えば「北村正，仲林清，大光明直孝，中
村修，“単語知識を利用した手書き文字列処理方式”，
ＮＴＴＲ＆Ｄ，３９，３，pp429-436 （１９９０）」に
おいて提案されている文字切り出し誤りによるずれを許
容する手法を用いることができる。Here, as a method of selecting a character string to be subjected to word collation, "Araki, Miyamoto, Suzuki, Nakamura, Sugimura," Signboard Character Recognition by Integrating Multiple Information ", 1998 IEICE Society Conference D-12-25. ], A method utilizing the features of a signboard can be used. As word matching methods, for example, "Tadashi Kitamura, Kiyoshi Nakabayashi, Akitaka Omitsu, Osamu Nakamura," Handwritten character string processing method using word knowledge ",
NTTR & D, 39, 3, pp 429-436 (1990) ”can be used to allow a shift due to a character extraction error.

【００１９】出力部９では，単語照合部８でカウントさ
れた一致文字数が最も多いレコ一ドを，単語照合結果と
して出力する。例えば企業名を出力し，インターネット
等の検索エンジンに入力することで，その企業に関する
情報を取得することが可能となる。また，電話番号や住
所情報をアンド条件で付与することにより，さらに取得
したい情報に絞り込むことが可能となる。The output unit 9 outputs a record having the largest number of matching characters counted by the word matching unit 8 as a word matching result. For example, by outputting a company name and inputting it to a search engine such as the Internet, information about the company can be obtained. Also, by giving the telephone number and the address information under the AND condition, it is possible to further narrow down the information to be acquired.

【００２０】このように構成した文字認識の動作および
作用を説明する。図２は，図１に示した文字認識装置の
動作を示すフローチャートである。The operation and operation of the thus configured character recognition will be described. FIG. 2 is a flowchart showing the operation of the character recognition device shown in FIG.

【００２１】まず，ステップ２０において企業名や電話
番号，住所，個人名等の書かれた文字画像を読み込み，
ステップ２１で文字列の切り出しを行う。ステップ２２
では切り出された文字列画像から，一文字ずつの文字画
像を切り出し，ステップ２３で文字認識を行う。First, in step 20, a character image in which a company name, a telephone number, an address, a personal name and the like are written is read.
In step 21, a character string is cut out. Step 22
Then, a character image of each character is extracted from the extracted character string image, and character recognition is performed in step 23.

【００２２】入力された画像から文字列を切り出す際の
イメージを図３に示す。画像入力部２は，例えば図３に
示すカメラ画像１１のように，看板なら看板１枚，ポス
ターならポスター１枚が入るように撮影されてキャプチ
ャされた画像を入力する。文字列切り出し部３は，この
ようなカメラ画像１１から，背景とのコントラストなど
によって文字の塊である矩形領域を文字列として切り出
す。これによって，図３のカメラ画像１１から２つの文
字列１２−１，１２−２が切り出されることになる。FIG. 3 shows an image of extracting a character string from an input image. The image input unit 2 inputs an image captured and captured such that one signboard is inserted for a signboard and one poster is inserted for a poster, as in a camera image 11 shown in FIG. 3, for example. The character string cutout unit 3 cuts out a rectangular area, which is a lump of characters, as a character string from such a camera image 11 by contrast with the background or the like. As a result, two character strings 12-1 and 12-2 are cut out from the camera image 11 in FIG.

【００２３】文字切り出し部４は，２つの文字列１２−
１，１２−２の各々について１文字ずつの文字単位に分
離し，この分離した文字画像１３を文字認識部５に渡
す。文字認識部５では，パターンマッチングなどにより
文字認識を行う。この結果，各文字列の認識結果１４−
１，１４−２が得られる。The character extracting section 4 includes two character strings 12-
Each of the characters 1 and 12-2 is separated into one character unit, and the separated character image 13 is passed to the character recognition unit 5. The character recognition unit 5 performs character recognition by pattern matching or the like. As a result, the recognition result 14-
1, 14-2 are obtained.

【００２４】上記処理において，文字列切り出し部３の
処理を省略し，図３（Ｂ）に示すように，カメラ画像１
１からダイレクトに文字画像１５を抽出し，それを文字
認識部５で認識するようにしてもよい。In the above processing, the processing of the character string cutout unit 3 is omitted, and as shown in FIG.
Alternatively, the character image 15 may be directly extracted from 1 and may be recognized by the character recognition unit 5.

【００２５】図２のステップ２４では，文字画像を読み
込んだ地点の位置情報を取得し，ステップ２５では，前
のステップ２４で取得された位置情報をもとに単語辞書
１の選択を行う。ここで，単語辞書１の選択とは，単語
辞書１の中で使用するレコードを選択することを意味す
る。なお，単語辞書１が複数ある場合には，位置情報に
基づいて適当な単語辞書１を選択することも含まれる。In step 24 of FIG. 2, the position information of the point where the character image is read is obtained. In step 25, the word dictionary 1 is selected based on the position information obtained in the previous step 24. Here, selecting the word dictionary 1 means selecting a record to be used in the word dictionary 1. When there are a plurality of word dictionaries 1, selecting an appropriate word dictionary 1 based on the position information is also included.

【００２６】例えば，図４（Ａ）に示すような企業名，
電話番号，住所，緯度経度情報を持つ単語辞書１があっ
た場合，単語辞書選択部７は，位置情報取得部６により
ＰＨＳやＧＰＳを用いて画像の撮影時に取得した緯度経
度情報から，位置計測の誤差に含まれる範囲内の緯度経
度情報を持つレコードに絞り込みを行い，絞り込まれた
辞書３１を得る。For example, a company name as shown in FIG.
When there is the word dictionary 1 having the telephone number, the address, and the latitude / longitude information, the word dictionary selection unit 7 performs the position measurement from the latitude / longitude information acquired at the time of capturing the image using the PHS or GPS by the position information acquisition unit 6. Are narrowed down to records having latitude / longitude information within a range included in the error, and a narrowed dictionary 31 is obtained.

【００２７】単語辞書１のレコードが緯度経度情報を持
たない場合には，図４（Ｂ）に示すように住所と緯度経
度情報の対応情報を持つ住所情報データベース（ＤＢ）
３２を用い，これによって位置情報取得部６により取得
した緯度経度情報を住所情報に変換し，変換後の住所情
報を用いて単語辞書１の絞り込みを行う。If the record of the word dictionary 1 does not have the latitude and longitude information, as shown in FIG. 4B, an address information database (DB) having correspondence information between the address and the latitude and longitude information
32, the latitude / longitude information acquired by the position information acquisition unit 6 is converted into address information, and the word dictionary 1 is narrowed down using the converted address information.

【００２８】これによって，例えばタウンページ・デー
タベース全体には，横須賀市を例に挙げると，２万５千
件のレコードが収められているが，位置情報によって１
００件とか数十件に単語照合する辞書サイズを小さくす
ることができ，高速化と高精度化を図ることができる。Thus, for example, the entire town page database contains 25,000 records in the case of Yokosuka City, for example.
It is possible to reduce the size of a dictionary for performing word matching on 00 or several tens of cases, thereby achieving high speed and high accuracy.

【００２９】ステップ２６では，選択された単語辞書１
（絞り込まれた辞書３１）と，ステップ２３において得
られた文字コード列を用いた単語照合を行い，レコード
ごとの単語照合結果となる一致文字数を算出しておく。
最後に，ステップ２７で単語照合結果の一致文字数が多
いものを最尤候補として出力して処理を終了する。In step 26, the selected word dictionary 1
Word matching is performed using (the narrowed-down dictionary 31) and the character code string obtained in step 23, and the number of matching characters as the word matching result for each record is calculated.
Finally, in step 27, a word matching result having a large number of matching characters is output as the maximum likelihood candidate, and the process ends.

【００３０】図５は，この単語照合の例を説明するため
の図である。例えば，Ｍ₁：“荒木歯科” Ｍ₂：“ＴＥＬ：１２−３４５６” Ｍ₃：“営業時間：月〜土ＡＭ１０時〜ＰＭ１５時” の３行の文字列Ｍ１〜Ｍ３が書かれた看板の画像から切
り出された文字列画像を，文字認識部５により認識した
結果，認識結果文字列として次のＣ１〜Ｃ３が得られた
とする。FIG. 5 is a diagram for explaining an example of this word collation. For example, M _1: "Araki _{dental" M 2: "TEL: 12-3456} " M 3: " Opening hours: Monday to Saturday at AM10 o'clock ~PM15" three lines of string M1~M3 is a sign that was written It is assumed that as a result of recognizing the character string image cut out from the image by the character recognizing unit 5, the following C1 to C3 are obtained as recognition result character strings.

【００３１】Ｃ₁：“荒本歯科” Ｃ₂：“ＴＦＩ：１７−８４５６” Ｃ₃：“官業時門：日−土ＡＭ０時〜ＰＭ１５時” 一方，この看板の画像を撮影した場所の位置情報をもと
に絞り込まれた辞書３１が，図５（Ｂ）に示すように，
レコードＲ₁〜Ｒ₄であったとすると，単語照合部８
は，認識結果文字列Ｃ₁〜Ｃ₃と，辞書３１中の各レコ
ードに含まれる企業名，電話番号，住所とのすべての組
み合わせを照合する。C ₁ : “Aramoto dentistry” C ₂ : “TFI: 17-8456” C ₃ : “Government time: Sunday-Saturday AM0 to PM15: 00” On the other hand, the position of the place where the image of this signboard was taken The dictionary 31 narrowed down based on the information, as shown in FIG.
Assuming that the records are R _{1 to} R ₄ , the word matching unit 8
Collates the recognition result string C ₁ -C _3, company names in each record in the dictionary 31, a telephone number, all the combinations of the addresses.

【００３２】すなわち，単語辞書選択部７において選択
したレコードＲ₁〜Ｒ₄のそれぞれについて，以下のよ
うに一致文字数を算出する。企業名とＣ₁，電話番号とＣ₂，住所とＣ₃を照合
し，一致文字数の合計を算出する。企業名とＣ₁，電話番号とＣ₃，住所とＣ₂を照合
し，一致文字数の合計を算出する。企業名とＣ₂，電話番号とＣ₁，住所とＣ₃を照合
し，一致文字数の合計を算出する。 ……。That is, the number of matching characters is calculated for each of the records R _{1 to} R ₄ selected by the word dictionary selecting section 7 as follows. Company name and the C _1, phone number and C _2, matches the address and C _3, to calculate the sum of the matched characters. Company name and the C _1, phone number and C _3, matches the address and C _2, to calculate the sum of the matched characters. The company name is compared with C ₂ , the telephone number with C ₁ , and the address with C _3, and the total number of matching characters is calculated. ......

【００３３】このようにすべての組み合わせで単語照合
を行い，その一致文字数の合計が最大となった組み合わ
せの単語照合結果を，そのレコードの単語照合結果とす
る。これを式で表すと，レコードＲ₁の単語照合結果
は，図５（Ｃ）に示すようになる。As described above, word matching is performed for all combinations, and the word matching result of the combination having the largest total number of matching characters is defined as the word matching result of the record. Expressing this by the formula, word collating result record R ₁ is as shown in FIG. 5 (C).

【００３４】Ｍａｘ（Ｎ₁₁とＣ₁の一致文字数＋Ｎ₁₂と
Ｃ₂の一致文字数＋Ｎ₁₃とＣ₃の一致文字数，Ｎ₁₁とＣ
₁の一致文字数＋Ｎ₁₂とＣ₃の一致文字数＋Ｎ₁₃とＣ₂
の一致文字数，Ｎ₁₁とＣ₂の一致文字数＋Ｎ₁₂とＣ₁の
一致文字数＋Ｎ₁₃とＣ₃の一致文字数，…………）＝ｎ
₁ レコードＲ₂の単語照合結果ｎ₂，レコードＲ₃の単語
照合結果ｎ₃，レコードＲ₄の単語照合結果ｎ₄も同様
に算出する。[0034] Max (N ₁₁ and matched characters + N ₁₂ and matched characters + number of matched characters of N ₁₃ and C ₃ of the C ₂ of C _1, N ₁₁ and C
Number of matched characters + N ₁₃ of the _first matched characters + N ₁₂ and C ₃ and C ₂
Number of matched characters, match the number of characters N ₁₁ and matched characters + N ₁₃ and C ₃ of matched characters + N ₁₂ and C ₁ to C _2, ............) = n of
₁ record R ₂ words matching result n _2, word collating result n ₃ records R _3, it is also calculated similarly word collating result n ₄ records R _4.

【００３５】最終的な看板の単語照合結果は，図５
（Ｄ）に示すように，ｎ₁，ｎ₂，ｎ₃，ｎ₄の中で最
大であるもののレコードである。これを単語照合結果と
して，出力部９により出力する。企業名だけを出力して
もよく，また電話番号と住所も合わせて出力するように
してもよい。The final word matching result of the signboard is shown in FIG.
As shown in (D), this is the record of the largest among n ₁ , n ₂ , n ₃ and n ₄ . This is output by the output unit 9 as a word matching result. Only the company name may be output, or the telephone number and the address may be output together.

【００３６】以上の実施の形態の説明から明らかなよう
に，本発明は，一般に高精度な文字認識結果を得るのが
難しい看板等の文字列を精度高く特定するために，位置
情報を使用し，高速高精度な単語照合を実現するもので
ある。これにより，チェーン店など，位置情報がなけれ
ば一つに特定できないものが，特定できるようになる。
したがって，従来の膨大な辞書による単語照合よりもさ
らに高精度な単語照合が可能となる。また，単に文字認
識するだけではなく，ユーザが見た対象を特定すること
になるので，その結果を用いた効率のよい情報検索の支
援が可能になる。As is clear from the above description of the embodiment, the present invention uses position information in order to specify a character string of a signboard or the like in which it is generally difficult to obtain a highly accurate character recognition result. , Realizing high-speed and high-precision word matching. As a result, items that cannot be uniquely identified without location information, such as chain stores, can be identified.
Therefore, word matching can be performed with higher accuracy than conventional word matching using a huge dictionary. In addition, not only character recognition, but also the target that the user has seen is specified, so that efficient information retrieval support using the result can be performed.

【００３７】[0037]

【発明の効果】以上説明したように，本発明によれば，
文字認識と位置情報とを統合することによって，単に文
字認識のみを用いた場合と比較して出力，表示される情
報を有効に絞り込むことが可能になる。したがって，表
示情報を手掛りとした利用者による情報検索を効率よく
支援することができる。As described above, according to the present invention,
By integrating the character recognition and the position information, it is possible to effectively narrow down the information to be output and displayed as compared with the case where only character recognition is used. Therefore, it is possible to efficiently support the information search by the user using the display information as a clue.

[Brief description of the drawings]

【図１】本発明の実施の形態の全体概要を示すブロック
図である。FIG. 1 is a block diagram showing an overall outline of an embodiment of the present invention.

【図２】本発明の実施の形態に係る動作を示すフローチ
ャートである。FIG. 2 is a flowchart showing an operation according to the embodiment of the present invention.

【図３】入力された画像から文字列を切り出す際の処理
を説明する図である。FIG. 3 is a diagram illustrating a process for extracting a character string from an input image.

【図４】単語辞書の選択を説明する図である。FIG. 4 is a diagram illustrating selection of a word dictionary.

【図５】単語照合を説明する図である。FIG. 5 is a diagram illustrating word matching.

[Explanation of symbols]

１単語辞書２画像入力部３文字列切り出し部４文字切り出し部５文字認識部６位置情報取得部７単語辞書選択部８単語照合部９出力部 DESCRIPTION OF SYMBOLS 1 Word dictionary 2 Image input part 3 Character string extraction part 4 Character extraction part 5 Character recognition part 6 Position information acquisition part 7 Word dictionary selection part 8 Word collation part 9 Output part

───────────────────────────────────────────────────── フロントページの続き (72)発明者桑野秀豪東京都新宿区西新宿三丁目19番２号日本電信電話株式会社内Ｆターム(参考） 5B064 BA01 EA19 5J062 AA01 BB00 CC07 GG02 5K024 AA79 CC11 GG01 GG10 5K036 KK09 9A001 BB03 BB04 FF03 HH20 HH22 HH28 JJ19 JJ25 JJ78 KK37 KK42 LL02 ────────────────────────────────────────────────── ─── Continuing on the front page (72) Inventor Hidego Kuwano 3-19-2 Nishi-Shinjuku, Shinjuku-ku, Tokyo F-term within Nippon Telegraph and Telephone Corporation (reference) 5B064 BA01 EA19 5J062 AA01 BB00 CC07 GG02 5K024 AA79 CC11 GG01 GG10 5K036 KK09 9A001 BB03 BB04 FF03 HH20 HH22 HH28 JJ19 JJ25 JJ78 KK37 KK42 LL02

Claims

[Claims]

An image input unit for inputting an image including character information such as a company name, a telephone number or a personal name; and a character image cut out from the image input by the image input unit.
A character recognition device including a recognition unit that recognizes the cut-out character image and outputs a recognition result character code string; a position information acquisition unit that acquires position information when an image is input by the image input unit; A word dictionary selection unit for narrowing down word dictionary information used for word matching based on the position information obtained by the position information obtaining unit; and a recognition unit using the word dictionary information narrowed down by the word dictionary selection unit. A word matching unit that performs word matching with the character code string output by the above, and an output unit that outputs, as a character recognition result, a word having the largest number of matching characters among the word matching results obtained by the word matching unit. A character recognition device characterized in that:

2. A step of inputting an image including character information such as a company name, a telephone number or a personal name, cutting out a character image from the input image, recognizing the cut-out character image, and recognizing a character as a recognition result. Outputting a code sequence;
A step of acquiring position information when the image is input; a step of narrowing down information in a word dictionary used for word matching based on the acquired position information; A character characterized by comprising a step of performing word matching between a word in a dictionary and the above-mentioned recognition result character code string, and a step of outputting, as a character recognition result, a word having the largest number of matching characters in the above word matching result. Recognition method.

3. A process of inputting an image including character information such as a company name, a telephone number or a personal name, cutting out a character image from the input image, recognizing the cut-out character image, and recognizing a character as a recognition result. A process of outputting a code sequence, a process of acquiring position information when the image is input, a process of narrowing down information in a word dictionary used for word matching based on the acquired position information, A process of performing word matching between a word in a word dictionary and the above-described recognition result character code string using dictionary information, a process of outputting a word having the largest number of matching characters in the above-described word matching result as a character recognition result, Recording a program for executing a character recognition method, characterized by recording a program for causing a computer to execute the method.