JP5240093B2

JP5240093B2 - ID card shooting system, ID card shooting method and program

Info

Publication number: JP5240093B2
Application number: JP2009150669A
Authority: JP
Inventors: 浩久稲本; 幸二小林; 酉華木原
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2009-06-25
Filing date: 2009-06-25
Publication date: 2013-07-17
Anticipated expiration: 2029-06-25
Also published as: JP2011009986A

Description

本発明は、画像認識技術に関する。 The present invention relates to an image recognition technique.

インターネットの発達に伴い、近年ではインターネットショッピングが盛んである。インターネットショッピングが盛んな理由には大きく３つが存在する。一つは世界中から珍しい商品を簡単に手に入れられるといったメリットである。次に、店舗に足を運ばなくて良い、商品は家まで届けてくれる、時間を選ばず買い物ができるといった手軽さに起因するメリットが有る。更には、様々な店舗の中から最も安い店で買い物ができるといったコストに起因するメリットが有る。 With the development of the Internet, Internet shopping has become popular in recent years. There are three main reasons why Internet shopping is so popular. One is the advantage that you can easily get rare products from all over the world. Next, there are merits due to the convenience of not having to visit the store, delivering the product to the house, and shopping at any time. Furthermore, there is a merit due to the cost of being able to shop at the cheapest store among various stores.

さて、前述の通り、インターネットで販売される商品は非常に多岐に渡る。その中には購買に年齢制限が存在する酒や煙草と言った嗜好品も含まれる。元来の対面での販売では、商品購入の際に店員による確認が可能であった。一方、インターネットショッピングにおいては、これが不可能になってしまっている。そのため、これらの商品を扱うインターネットショップでは、身分証明書の提示をユーザに求め、年齢確認を行ってから販売を行っている。尚、年齢確認をせずに販売しているショップもあるが、今後規制が進み前者のような販売スタイルは廃れることが予想される。 As described above, there are a wide variety of products sold on the Internet. These include luxury goods such as liquor and cigarettes that have age restrictions on purchases. In the original face-to-face sales, it was possible to confirm by the store clerk when purchasing the product. On the other hand, this has become impossible in Internet shopping. For this reason, in an Internet shop that handles these products, the user is requested to present an identification card, and the sale is performed after the age is confirmed. There are some shops that sell without checking their age, but regulations are expected to advance and the sales style like the former is expected to be eliminated.

ここで、身分証明書の提示をするためにユーザは以下のようなステップを踏んでいるのが現状である。１．コンビニなど、コピーの取れる所定の場所に行く。２．お金を払ってコピーをとる。３．申込用紙に必要事項を記入。４．封筒に入れて切手を買う。５．封筒を投函。６.年齢確認完了後発注。 Here, in order to present an identification card, the user is currently taking the following steps. 1. Go to a place where you can make a copy, such as a convenience store. 2. Pay money to make a copy. 3. Fill out the application form. 4). Buy stamps in envelopes. 5. Post envelopes. 6. Place an order after confirming the age.

本来、インターネットショッピングを行う主な理由には手軽さと、コストが有ることは前に述べたとおりである。しかし、上のような年齢確認のステップは手軽さとコストのメリットを大幅に損なうという問題がある。更にいうと、以上述べたような、年齢などの身分を確認するステップは、様々な場面でユーザの手軽さやコストといったメリットを損なうという問題がある。例えばネットバンクの口座開設などもこのような問題を持っている。 Basically, the main reason for Internet shopping is its simplicity and cost, as mentioned earlier. However, there is a problem that the age confirmation step as described above greatly reduces the merit of convenience and cost. Furthermore, as described above, the step of confirming the status such as age has a problem that the merit such as the convenience and cost of the user is impaired in various scenes. For example, opening a bank account has such a problem.

このような問題に対して特許文献１では専用の端末を用意し、自動車免許証を読み取って生年月日を認識して年齢認証を行い、該年齢認証結果を転送することにより、年齢制限のある商品をユーザに購買させるという発明がなされている。本発明を利用すれば上記の２.コピー取得。４．封筒に入れ切手を買う。５．封筒を投函と言った作業が軽減されるため、ユーザは比較的簡便に年齢確認を完了させられる。しかし、専用の端末が設置している場所まで赴く必要があり、依然、１.所定の場所に行く、という問題点は解消されていない。 With respect to such a problem, Patent Document 1 prepares a dedicated terminal, reads an automobile license, recognizes the date of birth, performs age verification, and transfers the age verification result to limit age. An invention has been made to allow a user to purchase a product. If the present invention is used, 2. Copy acquisition as described above. 4). Buy stamps in envelopes. 5. Since the operation of posting the envelope is reduced, the user can complete the age check relatively easily. However, it is necessary to go to a place where a dedicated terminal is installed, and the problem of 1. going to a predetermined place has not been solved.

一方、この発明に対して、専用の端末を用意するのではなく、ユーザの持つ携帯電話に付随しているデジタルカメラを用いて運転免許証を撮影し、年齢認証を行うという技術が容易に類推できる。この技術によれば自宅など、どこにいても自由に年齢認証が可能となり、ユーザの利便性は大幅に向上する。 On the other hand, instead of preparing a dedicated terminal for this invention, it is easy to analogize the technology of photographing a driver's license using a digital camera attached to a user's mobile phone and performing age authentication. it can. According to this technology, age authentication can be freely performed anywhere, such as at home, and the convenience of the user is greatly improved.

一方、専用の端末などと違い、携帯電話に付随するカメラは撮影条件が固定できないため、認識に好適な画像が得られるとは限らない。例えば運転免許証を斜めから撮影すると、図２３に示すように、あおり歪みと呼ばれる歪みが生じる。このような歪みが生じると運転免許証に記載された文字も歪むため、運転免許証に記載された内容の認識が難しくなる。 On the other hand, unlike a dedicated terminal or the like, an image suitable for recognition is not always obtained because a camera attached to a mobile phone cannot fix shooting conditions. For example, when a driver's license is photographed from an oblique direction, distortion called tilt distortion occurs as shown in FIG. When such distortion occurs, characters written on the driver's license are also distorted, making it difficult to recognize the contents written on the driver's license.

このあおり歪みを、画像処理を用いることで補正するという発明はいくつかなされている。例えば、特許文献２では画像中二つの線分を指定することで、あおり補正を行うという発明がなされている。本発明は、補正前の台形が補正後にどのような長方形に変換されるかというような線形変換マトリクスを算出して、該マトリクスを全面に施すことによりあおりの補正を行うという手段に起因するものである。図２４で示すように補正前と補正後とで、黒丸で示された４点ずつ計８点を指定する必要がある。 Several inventions have been made to correct this tilt distortion by using image processing. For example, Patent Document 2 discloses an invention in which tilt correction is performed by designating two line segments in an image. The present invention is based on means for calculating a linear transformation matrix such as what kind of rectangle the post-correction trapezoid is converted to after correction, and correcting the tilt by applying the matrix to the entire surface. It is. As shown in FIG. 24, it is necessary to designate a total of 8 points by 4 points indicated by black circles before and after correction.

前記発明では補正前の４点をユーザが指定する。更に補正後の４点を計算により算出するが、補正後の４点に対しては簡易的に算出しているため、正確に補正できない。実は補正後の画像における、長方形の辺の長さを求めるためには、カメラに対する運転免許証の傾き角度と、カメラから運転免許証までの距離が必要である。 In the above invention, the user designates four points before correction. Furthermore, four points after correction are calculated by calculation. However, since the four points after correction are simply calculated, they cannot be corrected accurately. Actually, in order to obtain the length of the side of the rectangle in the corrected image, the inclination angle of the driving license with respect to the camera and the distance from the camera to the driving license are required.

撮影角度によって長方形の辺の長さが変化してしまう仕組みを図２５で説明する。
カメラは撮影した３次元空間を平面で表現するため、３次元空間の各点はカメラと点を結んだ線が、像ができる面と交わる点に記述される。その結果本来長さL1+L2であった長方形の辺はl1+l2と表現される。l1とL1との関係は以下の数式で記述できる。 A mechanism in which the length of a rectangular side changes depending on the shooting angle will be described with reference to FIG.
Since the camera expresses the photographed three-dimensional space with a plane, each point in the three-dimensional space is described at a point where a line connecting the camera and the point intersects the plane where the image can be formed. As a result, the rectangular side originally having the length L1 + L2 is expressed as l1 + l2. The relationship between l1 and L1 can be described by the following mathematical formula.

ここでdはカメラと像ができる面との距離でθは像ができる面に対して、長方形がどれだけ傾いているかの角度である。以上のようにθとdが明らかにならないと元の長方形の大きさは判らない。つまり、角度θと距離dを何らかの手段で取得しない限り、正確なあおり補正を行うことは不可能である。何らかの手段とは例えば測距センサなどである。 Here, d is the distance between the camera and the image-forming surface, and θ is the angle of how much the rectangle is inclined with respect to the image-forming surface. As described above, the size of the original rectangle cannot be determined unless θ and d are clarified. That is, accurate tilt correction is impossible unless the angle θ and the distance d are acquired by some means. Some means is, for example, a distance measuring sensor.

これに対して、特許文献３では撮像装置に距離を計測する手段と角度をユーザに入力させる手段を具備し、それにより正確なあおり補正を行う発明がなされているが、携帯電話に付随するカメラにはそのような手段は具備されていないという問題があった。 On the other hand, Patent Document 3 discloses an invention in which an imaging device is provided with a means for measuring a distance and a means for allowing a user to input an angle, whereby accurate tilt correction is performed. However, there is a problem that such a means is not provided.

また特許文献４では、簡易的に補正後の長方形を算出し、その後、長方形のアスペクト比をユーザに指定させることによって正確なあおり補正を行っているが、アスペクト比の指定という作業はユーザにとっては難解かつ煩雑であるという問題があった。 In Patent Document 4, the corrected rectangle is simply calculated, and then correct tilt correction is performed by allowing the user to specify the aspect ratio of the rectangle. However, the task of specifying the aspect ratio is difficult for the user. There was a problem that it was difficult and complicated.

本発明は以上の問題を解決するためのものであり、身分証明書を撮影した画像の、あおり歪みを正確に、しかも自動的に補正できるようにすることで、身分証明書に記載された内容を好適に認識させることを目的としている。 The present invention is for solving the above-mentioned problems, and the contents described in the identification card can be corrected accurately and automatically by correcting the distortion of the image obtained by photographing the identification card. This is intended to make it suitable for recognition.

請求項１に記載の発明は、身分証明書を撮影する撮像装置と、画像に顔が含まれるか否かを判定する顔検出手段と、顔が含まれる写真領域を検出する写真検出手段と、画像全体のあおりを補正するあおり補正手段とを備え、撮像装置により得られた画像において、顔検出手段により顔が検出された場合に、写真検出手段による写真領域の検出を行い、写真検出手段により得られた写真の大きさを用いて、あおり補正後の画像のアスペクト比を補正することを特徴とする身分証明書撮影システムである。 The invention described in claim 1 is an imaging device that photographs an identification card, a face detection unit that determines whether or not a face is included in an image, a photo detection unit that detects a photographic region including a face, A tilt correction unit that corrects a tilt of the entire image, and when a face is detected by the face detection unit in the image obtained by the imaging device, a photo area is detected by the photo detection unit, and the photo detection unit The identification photograph photographing system is characterized in that the aspect ratio of the image after tilt correction is corrected using the size of the obtained photograph.

請求項２に記載の発明は、請求項１記載の身分証明書撮影システムにおいて、あおり補正後の画像に対して、顔検出処理を施すことを特徴とする。 According to a second aspect of the present invention, in the identification card photographing system according to the first aspect, a face detection process is performed on the image after the tilt correction.

請求項３に記載の発明は、請求項１記載の身分証明書撮影システムにおいて、あおり補正前の画像に対して、顔検出処理を施すことを特徴とする。 According to a third aspect of the present invention, in the identification card photographing system according to the first aspect, a face detection process is performed on the image before the correction of the tilt.

請求項４に記載の発明は、請求項２もしくは３記載の身分証明書撮影システムにおいて、顔検出手段は、撮像装置に撮影される画像にて、概ね所定の方向を向いているときにのみ顔の検出が行えるようになっていることを特徴とする。 According to a fourth aspect of the present invention, in the identification card photographing system according to the second or third aspect, the face detecting means is a face only when the image captured by the imaging device is generally facing a predetermined direction. It is possible to detect.

請求項５に記載の発明は、請求項４記載の身分証明書撮影システムにおいて、さらに表示手段を備え、検出した顔写真領域の候補を表示し、候補が複数存在する場合には、表示手段に複数の結果を表示し、ユーザに選択させることを特徴とする。 According to a fifth aspect of the present invention, in the identification card photographing system according to the fourth aspect of the present invention, the information photographing system further includes a display unit, and displays the detected face photo area candidates. A plurality of results are displayed and the user is allowed to select.

請求項６に記載の発明は、請求項５記載の身分証明書撮影システムにおいて、写真領域の大きさと位置に応じて、身分証明書の記載事項読み取りを開始するか否かを決定することを特徴とする。 According to a sixth aspect of the present invention, in the identification card photographing system according to the fifth aspect, it is determined whether or not to start reading the description items of the identification card according to the size and position of the photo area. And

請求項７に記載の発明は、請求項１記載の身分証明書撮影システムにおいて、撮像装置は、撮影時に自動的にマクロモードになることを特徴とする。 According to a seventh aspect of the present invention, in the identification card photographing system according to the first aspect, the imaging device automatically enters a macro mode during photographing.

請求項８に記載の発明は、請求項１記載の身分証明書撮影システムにおいて、身分証明書に記載された事項を読み取るＯＣＲ手段を有しており、該ＯＣＲ手段は、撮像装置とは別のハードウェアに実装されていることを特徴とする。 The invention according to claim 8 is the identification card photographing system according to claim 1, further comprising OCR means for reading matters described in the identification card, wherein the OCR means is different from the imaging device. It is implemented in hardware.

請求項９に記載の発明は、請求項１記載の身分証明書撮影システムにおいて、あおり補正手段は、写真検出手段により得られた写真領域に応じて、あおり補正を行うことを特徴とする。 According to a ninth aspect of the present invention, in the identification card photographing system according to the first aspect, the tilt correcting means performs the tilt correction in accordance with the photo area obtained by the photo detecting means.

請求項１０に記載の発明は、請求項１記載の身分証明書撮影システムにおいて、撮影時に表示手段へ、身分証名書の概形を表示し、該概形に合わせて撮影することをユーザに促すことを特徴とする。 According to the tenth aspect of the present invention, in the identification card photographing system according to the first aspect, the outline of the identification card is displayed on the display means at the time of photographing, and the user is allowed to photograph according to the rough shape. It is characterized by prompting.

請求項１１に記載の発明は、身分証明書を撮影する撮像装置と、画像に顔が含まれるか否かを判定する顔検出手段と、顔が含まれる写真領域を検出する写真検出手段と、画像全体のあおりを補正するあおり補正手段とを備える身分証明書撮影システムにおける身分証明書撮影方法であって、撮像装置により得られた画像において、顔検出手段により顔が検出された場合に、写真検出手段による写真領域の検出を行い、写真検出手段により得られた写真の大きさを用いて、あおり補正後の画像のアスペクト比を補正することを特徴とする身分証明書撮影方法である。 The invention described in claim 11 is an imaging device that photographs an identification card, a face detection unit that determines whether or not a face is included in an image, a photo detection unit that detects a photographic region including a face, An identification card photographing method in an identification card photographing system comprising a tilt correcting means for correcting a tilt of an entire image, wherein a face is detected by a face detecting means in an image obtained by an imaging device. The identification image photographing method is characterized in that a photo area is detected by a detecting means, and the aspect ratio of the image after tilt correction is corrected using the size of the photograph obtained by the photo detecting means.

請求項１２に記載の発明は、請求項１１記載の身分証明書撮影方法において、あおり補正後の画像に対して、顔検出処理を施すことを特徴とする。 According to a twelfth aspect of the present invention, in the identification card photographing method according to the eleventh aspect, a face detection process is performed on the image after tilt correction.

請求項１３に記載の発明は、請求項１１記載の身分証明書撮影方法において、あおり補正前の画像に対して、顔検出処理を施すことを特徴とする。 A thirteenth aspect of the invention is characterized in that, in the identification card photographing method of the eleventh aspect, face detection processing is performed on an image before tilt correction.

請求項１４に記載の発明は、請求項１２もしくは１３記載の身分証明書撮影方法において、顔検出手段は、撮像装置に撮影される画像にて、概ね所定の方向を向いているときにのみ顔の検出が行えるようになっていることを特徴とする。 According to a fourteenth aspect of the present invention, in the identification card photographing method according to the twelfth or thirteenth aspect, the face detection means is a face only when the image is photographed by the imaging device and is generally facing a predetermined direction. It is possible to detect.

請求項１５に記載の発明は、請求項１４記載の身分証明書撮影方法において、身分証明書撮影システムはさらに表示手段を備え、検出した顔写真領域の候補を表示し、候補が複数存在する場合には、表示手段に複数の結果を表示し、ユーザに選択させることを特徴とする。 According to a fifteenth aspect of the present invention, in the identification card photographing method according to the fourteenth aspect, the identification card photographing system further includes a display unit, and displays the detected face photo area candidates, and there are a plurality of candidates. Is characterized in that a plurality of results are displayed on the display means and the user is allowed to select.

請求項１６に記載の発明は、請求項１５記載の身分証明書撮影方法において、写真領域の大きさと位置に応じて、身分証明書の記載事項読み取りを開始するか否かを決定することを特徴とする。 According to a sixteenth aspect of the present invention, in the identification card photographing method according to the fifteenth aspect, it is determined whether or not to start reading the description items of the identification card according to the size and position of the photo area. And

請求項１７に記載の発明は、請求項１１記載の身分証明書撮影方法において、撮像装置は、撮影時に自動的にマクロモードになることを特徴とする。 According to a seventeenth aspect of the present invention, in the identification card photographing method according to the eleventh aspect, the imaging device automatically enters a macro mode during photographing.

請求項１８に記載の発明は、請求項１１記載の身分証明書撮影方法において、身分証明書に記載された事項を読み取るＯＣＲ手段を有しており、該ＯＣＲ手段は、撮像装置とは別のハードウェアに実装されていることを特徴とする。 The invention according to claim 18 is the identification card photographing method according to claim 11, further comprising OCR means for reading the items described in the identification card, wherein the OCR means is different from the imaging device. It is implemented in hardware.

請求項１９に記載の発明は、請求項１１記載の身分証明書撮影方法において、あおり補正手段は、写真検出手段により得られた写真領域に応じて、あおり補正を行うことを特徴とする。 According to a nineteenth aspect of the present invention, in the identification card photographing method according to the eleventh aspect, the tilt correcting means corrects the tilt according to the photo area obtained by the photo detecting means.

請求項２０に記載の発明は、請求項１１記載の身分証明書撮影方法において、撮影時に表示手段へ、身分証名書の概形を表示し、該概形に合わせて撮影することをユーザに促すことを特徴とする。 According to a twenty-second aspect of the present invention, in the identification card photographing method according to the eleventh aspect, the outline of the identification card is displayed on the display means at the time of photographing, and the user is allowed to photograph according to the rough shape. It is characterized by prompting.

請求項２１に記載の発明は、コンピュータに、身分証明書を撮影する撮像装置により得られた画像を取得し、画像に顔が含まれるか否かを判定する顔検出手段により顔が検出された場合に、顔が含まれる写真領域を検出する写真検出手段による写真領域の検出を行い、写真検出手段により得られた写真の大きさを用いて、あおり補正後の画像のアスペクト比を補正する処理を実行させることを特徴とするプログラムである。 In the invention according to claim 21, a face is detected by face detection means for acquiring an image obtained by an imaging device for photographing an identification card in a computer and determining whether or not the image includes a face. In this case, a process for detecting a photo area by a photo detection unit that detects a photo area including a face, and correcting the aspect ratio of the image after tilt correction using the size of the photo obtained by the photo detection unit. It is a program characterized by executing.

請求項２２に記載の発明は、請求項２１記載のプログラムにおいて、あおり補正後の画像に対して、顔検出処理を施すことを特徴とする。 According to a twenty-second aspect of the present invention, in the program according to the twenty-first aspect, face detection processing is performed on the image after tilt correction.

請求項２３に記載の発明は、請求項２１記載のプログラムにおいて、あおり補正前の画像に対して、顔検出処理を施すことを特徴とする。 According to a twenty-third aspect of the invention, in the program according to the twenty-first aspect, face detection processing is performed on an image before tilt correction.

請求項２４に記載の発明は、請求項２２もしくは２３記載のプログラムにおいて、顔検出手段により、撮像装置に撮影される画像にて、概ね所定の方向を向いているときにのみ顔の検出が行えるようになっていることを特徴とする。 According to a twenty-fourth aspect of the present invention, in the program according to the twenty-second or twenty-third aspect, the face can be detected only when the face detection unit is facing a predetermined direction in the image photographed by the imaging device. It is characterized by that.

請求項２５に記載の発明は、請求項２４記載のプログラムにおいて、さらに検出した顔写真領域の候補を表示し、候補が複数存在する場合には、表示手段に複数の結果を表示し、ユーザに選択させることを特徴とする。 According to a twenty-fifth aspect of the present invention, in the program according to the twenty-fourth aspect, the detected face photograph area candidates are further displayed, and when there are a plurality of candidates, a plurality of results are displayed on the display means, and It is made to select.

請求項２６に記載の発明は、請求項２５記載のプログラムにおいて、写真領域の大きさと位置に応じて、身分証明書の記載事項読み取りを開始するか否かを決定することを特徴とする。 According to a twenty-sixth aspect of the present invention, the program according to the twenty-fifth aspect is characterized in that it is determined whether or not to start reading the items described in the identification card according to the size and position of the photo area.

請求項２７に記載の発明は、請求項２１記載のプログラムにおいて、あおり補正手段により、写真検出手段で得られた写真領域に応じて、あおり補正を行うことを特徴とする。 According to a twenty-seventh aspect of the invention, in the program according to the twenty-first aspect, the tilt correction unit performs the tilt correction according to the photographic area obtained by the photo detection unit.

請求項２８に記載の発明は、請求項２１記載のプログラムにおいて、撮影時に表示手段へ、身分証名書の概形を表示し、該概形に合わせて撮影することをユーザに促すことを特徴とする。 The invention according to claim 28 is the program according to claim 21, wherein the outline of the identification card is displayed on the display means at the time of photographing, and the user is encouraged to photograph according to the outline. And

身分証明書を撮影した画像の、あおり歪みを正確に、しかも自動的に補正できるようにすることで、身分証明書に記載された内容を好適に認識させることができる。 By making it possible to correct the tilt distortion of the image obtained by photographing the identification card accurately and automatically, the contents described in the identification card can be suitably recognized.

本発明の第一の実施形態に係るユーザから見たシステムの動作を説明する図である。It is a figure explaining operation | movement of the system seen from the user which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係る送信フォームについて説明する図である。It is a figure explaining the transmission form which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係る画像領域を示す図である。It is a figure which shows the image area | region which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係る該アプリケーションの動作を示すフローチャートである。It is a flowchart which shows operation | movement of this application which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係る顔検出処理により検出される顔の領域の図である。It is a figure of the area | region of the face detected by the face detection process which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係る写真領域の位置とサイズが特定される図である。It is a figure by which the position and size of the photograph area | region which concern on 1st embodiment of this invention are specified. 本発明の第一の実施形態に係るフィルタ群である。It is a filter group concerning a first embodiment of the present invention. 本発明の第一の実施形態に係る正解画像について説明する図である。It is a figure explaining the correct image which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係るAdaboostによる分類規則の生成手法のフローチャートを示した図である。It is the figure which showed the flowchart of the production | generation method of the classification rule by Adaboost which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係るあおり補正について説明する図である。It is a figure explaining tilt correction concerning a first embodiment of the present invention. 本発明の第一の実施形態に係る画像全体にフィルタを施すことについて説明する図である。It is a figure explaining applying a filter to the whole picture concerning a first embodiment of the present invention. 本発明の第一の実施形態に係る実際に多数の曲線を描いた図である。It is the figure which actually drawn many curves concerning a first embodiment of the present invention. 本発明の第一の実施形態に係る補正後の４点の算出方法の図である。It is a figure of the calculation method of four points after amendment concerning a first embodiment of the present invention. 本発明の第一の実施形態に係る写真領域の特定を示したフローチャートである。It is the flowchart which showed specification of the photograph area | region which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係る左辺の検出について説明する図である。It is a figure explaining the detection of the left side which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係る探索範囲について説明する図である。It is a figure explaining the search range which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係るサーバの具備する機能を説明する図である。It is a figure explaining the function which the server concerning a first embodiment of the present invention has. 本発明の第一の実施形態に係る免許証読み取り部の動作を示したフローチャートである。It is the flowchart which showed operation | movement of the license reading part which concerns on 1st embodiment of this invention. 本発明の第一の実施形態に係る矩形領域を示した図である。It is the figure which showed the rectangular area which concerns on 1st embodiment of this invention. 本発明の第二の実施形態に係る運転免許証アプリケーションの動作を示すフローチャートである。It is a flowchart which shows operation | movement of the driver's license application which concerns on 2nd embodiment of this invention. 本発明の第二の実施形態に係る写真領域特定処理のフローチャートである。It is a flowchart of the photographic area | region specific process which concerns on 2nd embodiment of this invention. 本発明の第三の実施形態に係る画像全体にエッジ検出処理、Hough変換、交点検出処理を行うことについて説明する図である。It is a figure explaining performing edge detection processing, Hough transformation, and intersection detection processing to the whole picture concerning a third embodiment of the present invention. 従来のあおり補正を説明する図である。It is a figure explaining the conventional tilt correction. 従来のあおり補正を説明する図である。It is a figure explaining the conventional tilt correction. 従来の撮影角度によって長方形の辺の長さが変化してしまう仕組みを説明する図である。It is a figure explaining the mechanism in which the length of the side of a rectangle changes with the conventional photography angles.

以下、本発明の好適な実施の形態として、クライアントサーバ型のインターネットショッピングに利用できる身分証明書撮影システムを例に挙げる。 Hereinafter, as a preferred embodiment of the present invention, an identification card photographing system that can be used for client-server type Internet shopping will be described as an example.

第１の実施の形態
ユーザから見たシステムの動作
まず図１を元に、ユーザから見たシステムの動作を説明する。尚、前提は以下の通りである。１．ユーザのPCはインターネットに接続しており、インターネットを通じてサーバと通信できる。２．ユーザは携帯電話を所持している。３．該携帯電話は無線でインターネットを通じてサーバと通信できる。４．該携帯電話にはカメラが付随している。５．該携帯電話はQRコードから情報を抽出できる。６．該携帯電話は無線でインターネットを通じてメールを受信することができる。 First Embodiment Operation of System Viewed from User First, the operation of the system viewed from the user will be described with reference to FIG. The premise is as follows. 1. The user's PC is connected to the Internet and can communicate with the server through the Internet. 2. The user has a mobile phone. 3. The mobile phone can communicate with the server wirelessly over the Internet. 4). The mobile phone is accompanied by a camera. 5. The mobile phone can extract information from the QR code. 6). The mobile phone can receive mail over the Internet wirelessly.

まず、ユーザはPCをインターネットに接続し、webページ上で様々な検索を行う(step001)。所望の商品が見つかった場合に、webページ上に配されたボタンをマウスクリックにより押下する。すると、webページからは該商品の購入決定の通知がサーバへと送信される(step002)。サーバへと通知された商品が成人にしか買えない商品である場合(step003)、サーバからは成人認証の必要性がPCを通じて通知されるとともに、webページ上にQRコードが表示される(step004)。表示されたQRコードをユーザが自分の携帯電話で撮影すると(Step005)、携帯電話はQRコードを認識し、QRコードに埋め込まれたアドレスを検知する。検知したアドレスはユーザの携帯電話のディスプレイに表示され、ユーザが該アドレスをクリックすると、携帯電話はサーバにアプリケーションの受信要求を行う。 First, a user connects a PC to the Internet and performs various searches on a web page (step 001). When a desired product is found, a button arranged on the web page is pressed by a mouse click. Then, a notification of purchase decision of the product is transmitted from the web page to the server (step 002). If the product notified to the server is a product that can only be purchased by adults (step 003), the server will notify the necessity of adult authentication through the PC and a QR code will be displayed on the web page (step 004) . When the user captures the displayed QR code with his / her mobile phone (Step 005), the mobile phone recognizes the QR code and detects the address embedded in the QR code. The detected address is displayed on the display of the user's mobile phone, and when the user clicks on the address, the mobile phone requests the server to receive an application.

受信要求を受けたサーバから携帯電話へ成人認証のための運転免許証撮影アプリケーションがダウンロードされる(step006)。ユーザが該アプリケーションを起動すると、運転免許証の撮影が促される。それに従い、ユーザは自らの運転免許証を撮影する(step007)。尚、該アプリケーションを用いた撮影については、後述する。 A driver's license photographing application for adult authentication is downloaded from the server that has received the reception request to the mobile phone (step 006). When the user starts the application, the user is prompted to take a driver's license. Accordingly, the user photographs his driver's license (step 007). Note that shooting using the application will be described later.

ユーザによって撮影された運転免許証の画像はサーバに送信される(step008)。該運転免許証画像に基づいてサーバはユーザが成人であるか否かを判定する。サーバは、成人認証の結果が、成人でない場合、該商品を購入できないという内容の記載されたアラートをユーザの携帯電話にメールで送信する(step011)。この場合、ユーザは買い物を続けることができず買い物は終了する。 The driver's license image taken by the user is transmitted to the server (step 008). Based on the driver's license image, the server determines whether the user is an adult. If the result of the adult authentication is not an adult, the server sends an alert describing that the product cannot be purchased to the user's mobile phone by e-mail (step 011). In this case, the user cannot continue shopping and the shopping ends.

一方、成人であると判定した場合、サーバは、ユーザの携帯電話に商品を購入するための住所や名前などといった情報を入力するためのフォームと送信ボタンが記入されたhtmlメールを送信する（図２参照）。 On the other hand, if it is determined that the user is an adult, the server transmits an html mail in which a form for inputting information such as an address and a name for purchasing the product and a send button are filled in the user's mobile phone (see FIG. 2).

該メールを受信(step010)したユーザは、フォームに必要事項を記入し(step012)、送信ボタンを押すことでサーバに情報を送信する(step013)。最後に、商品を提供する側は、該フォームに入力された内容に基づいてユーザに代金を請求するとともに、ユーザへ商品を発送することで買い物は終了する。 The user who has received the mail (step 010) fills in the necessary information on the form (step 012), and transmits information to the server by pressing the send button (step 013). Finally, the merchandise providing side charges the user based on the contents entered in the form, and the merchandise is shipped to the user, thereby completing the shopping.

尚、step003において年齢認証が必要ないと判定された場合、サーバは、フォームの添付されたメールをユーザに送付する(step010)が、年齢認証が必要な場合とは異なり、該htmlはユーザが閲覧中のwebページに送信されるものとする。これにより年齢認証が必要ない場合に携帯電話を利用しなくて良いのでユーザの利便性が向上する。 If it is determined in step 003 that age verification is not required, the server sends an email with a form attached to the user (step 010). Unlike the case where age verification is required, the html is viewed by the user. It is sent to the web page inside. Thereby, when age verification is not required, the user's convenience is improved because it is not necessary to use a mobile phone.

運転免許証撮影アプリケーションの動作
次に前述の運転免許証撮影アプリケーションを用いてユーザが免許証を撮影する際の動作について説明する。図４は該アプリケーションの動作を示すフローチャートである。ユーザが撮影を開始すると、携帯電話に付属しているカメラによって画像の取得がなされる(step100)。尚、撮影が開始された場合に、カメラが自動的にマクロモードになることが望ましい。マクロモードとは、近接した物体にピントが合いやすくするモードである。免許証の記載内容を正確に読み取るためには、比較的近い位置に有る免許証を、ピントを合わせながら撮影する必要がある。マクロモードが搭載された携帯電話のデジタルカメラは一般的であるが、ユーザは概してマクロモードの存在を知らない。そのため、マクロモードの設定をアプリケーションが自動的に設定することが望ましい。 Operation of Driver's License Shooting Application Next, an operation when a user photographs a driver's license using the above-described driver's license shooting application will be described. FIG. 4 is a flowchart showing the operation of the application. When the user starts photographing, an image is acquired by a camera attached to the mobile phone (step 100). It is desirable that the camera automatically enters the macro mode when shooting is started. The macro mode is a mode that makes it easy to focus on a close object. In order to accurately read the contents of the license, it is necessary to photograph the license at a relatively close position while focusing. Although the digital camera of the mobile phone equipped with the macro mode is common, the user generally does not know the existence of the macro mode. Therefore, it is desirable that the application automatically sets the macro mode setting.

該アプリケーションは取得した画像に対して、顔検出処理を行う(step101)。尚、顔検出処理の詳しい手段については後述する。該アプリケーションは顔が無かった場合(step102)、画像取得に戻る。一方、顔が存在していた場合（step102）、続いて顔が所定の領域に含まれるか否かを該アプリケーションが判定する。所定の領域とは、画像全体において顔の中心位置が右1/3に位置しており、更に顔の中心位置が画像の上下方向において中心1/3に含まれている領域とする（図３の斜線領域）。 The application performs face detection processing on the acquired image (step 101). Detailed means of the face detection process will be described later. If there is no face (step 102), the application returns to image acquisition. On the other hand, when the face exists (step 102), the application determines whether or not the face is included in a predetermined area. The predetermined area is an area in which the center position of the face is located on the right 1/3 in the entire image and the center position of the face is included in the center 1/3 in the vertical direction of the image (FIG. 3). Shaded area).

更に、該アプリケーションでは検出された顔領域のサイズが、画面全体の3％以上10％以下を占めるかを判定する。これら全てが満たされた場合に、該アプリケーションは写真領域特定のステップ(step104)に進む。それらの一つでも満たされない場合には、該アプリケーションは画像取得(step100)に戻る。 Further, the application determines whether the size of the detected face area occupies 3% or more and 10% or less of the entire screen. If all of these are satisfied, the application proceeds to the photo area identification step (step 104). If one of them is not satisfied, the application returns to image acquisition (step 100).

ここで位置の検出された顔の位置を判定するのは、顔の誤検出を防ぐためである。また、成人認証を行うために、必要な領域が画面に全て収まっていることを判定する目的もある。尚、これらの数値は経験的に決定したものであり、特に理論的な裏づけが有るわけではない。 The reason for determining the position of the face where the position is detected is to prevent erroneous detection of the face. Another purpose is to determine that all necessary areas fit on the screen for adult authentication. These numerical values are determined empirically and have no theoretical support.

ここで所定の領域であるかの判定手段を満たさない原因を該アプリケーションが携帯電話の画面を通じてユーザに通知すると、ユーザはそれに合わせて撮影条件を調整できるので、よりスムーズに撮影を行うことが可能になる。 If the application notifies the user of the reason why the predetermined area is not satisfied through the screen of the mobile phone, the user can adjust the shooting conditions accordingly, so that shooting can be performed more smoothly. become.

続いて、撮影された免許証画像に対して該アプリケーションはあおり補正(step104)を行う。あおり補正は前述の通り、図２３で示したような台形に歪んでしまった長方形を、元の長方形に戻す処理である。処理の詳細については後述するが、発明の背景でも記載したとおり、ここで元の長方形に対して正確なアスペクト比を持つ長方形画像へと補正することは不可能であるため、アスペクト比が不正確な画像へと補正される。 Subsequently, the application performs tilt correction (step 104) on the photographed license image. As described above, the tilt correction is a process of returning a rectangle that has been distorted into a trapezoid as shown in FIG. 23 to the original rectangle. Details of the processing will be described later, but as described in the background of the invention, it is impossible to correct the rectangular image having an accurate aspect ratio with respect to the original rectangle, so the aspect ratio is inaccurate. To correct images.

あおり補正に続いて、該アプリケーションは撮影された運転免許証の写真領域を特定する（step104）。一般に後述する顔検出処理により検出される顔の領域は図５の太線で示すように、写真全体ではなく、顔の領域のみになってしまう場合が多い。そのため、本実施形態においては、該アプリケーションは顔の領域だけでなく、写真領域の特定を行う。尚、本発明における写真領域とは、写真により得られた画像が記載されている領域のことを示す。実際には写真が貼られている領域でもよいし、写真を印刷した領域でもよい。また、写真領域の特定の手段については後述するが、写真領域の候補は複数検出される場合が有る。この場合には該アプリケーションは携帯電話に付随するディスプレイを用いて候補を全て表示し、ユーザに選択させる。これにより顔領域特定の間違いを大幅に減らすことができる。 Following the tilt correction, the application identifies the photographed area of the driver's license that was shot (step 104). In general, the face area detected by the face detection process described later is not the entire picture but only the face area as shown by the thick line in FIG. Therefore, in this embodiment, the application specifies not only the face area but also the photo area. In addition, the photographic area | region in this invention shows the area | region where the image obtained by the photograph is described. Actually, it may be an area where a photograph is pasted or an area where a photograph is printed. In addition, although a specific means for photographic area will be described later, a plurality of photographic area candidates may be detected. In this case, the application displays all candidates using a display attached to the mobile phone, and allows the user to select. As a result, mistakes in specifying the face area can be greatly reduced.

Step105によって該アプリケーションが写真領域を特定すると、免許証画像全体において、写真領域の位置とサイズが特定されるが、（図６参照）step106では得られたw,とhの比に応じて画面全体の縦横比を該アプリケーションが変化させる。日本における免許証の写真はw:h=24:30である。これに合わせて、画像の縦方向を拡大縮小する。 When the application identifies the photograph area in Step 105, the position and size of the photograph area are identified in the entire license image (see FIG. 6). In Step 106, the entire screen is displayed according to the ratio of w and h obtained. The application changes the aspect ratio of. The photo of the license in Japan is w: h = 24: 30. In accordance with this, the vertical direction of the image is enlarged or reduced.

アスペクト比補正が行われた画像を該アプリケーションがサーバに送信(step0107)することにより、該アプリケーションの処理は完了する。 When the application transmits the image subjected to the aspect ratio correction to the server (step 0107), the processing of the application is completed.

顔検出手段(図４-step101)
顔検出については文献１で紹介されている手法を用いるものとする。今回は非特許文献１記載の手法を用いたが、もちろん、固有顔法などといった手法を用いても構わないし、単純なパタンマッチを用いても構わない。顔が検出できる手法であれば何でも良い。
非特許文献１
Rapid Object Detection using a Boosted Cascade of Simple Features. IEEE CVPR, 2001.
その詳細は非特許文献２に譲るが、ここでは、その概要について述べる。非特許文献１で紹介される手法のエッセンスは、Rectangle Filterと呼ばれるフィルタ及び、Adaboost(非特許文献２)と呼ばれる学習手段であると言える。
非特許文献２
The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer Series in Statistics. 2001.
Rectangle Filterとは図７に示す様な、フィルタ群である。 Face detection means (Fig.4-step101)
For the face detection, the method introduced in Document 1 is used. This time, the method described in Non-Patent Document 1 is used, but of course, a method such as the eigenface method may be used, or a simple pattern match may be used. Any technique can be used as long as it can detect a face.
Non-patent document 1
Rapid Object Detection using a Boosted Cascade of Simple Features.IEEE CVPR, 2001.
The details are given in Non-Patent Document 2, but here, an outline thereof will be described. The essence of the technique introduced in Non-Patent Document 1 can be said to be a filter called Rectangle Filter and a learning means called Adaboost (Non-Patent Document 2).
Non-Patent Document 2
The Elements of Statistical Learning: Data Mining, Inference, and Prediction.Springer Series in Statistics.2001.
The Rectangle Filter is a group of filters as shown in FIG.

例えば図７の1.Edge Features(a)を画像に適用することを考える。フィルタをスキャンして、白い矩形に含まれた領域の画素の平均値と、黒い矩形に含まれる領域の画素の平均値との差を以て、フィルタ中心部の出力値とする。これにより特定のエッジパタンとの一致度を算出できる。 For example, consider applying 1. Edge Features (a) in FIG. 7 to an image. The filter is scanned, and the difference between the average value of the pixels in the area included in the white rectangle and the average value of the pixels in the area included in the black rectangle is used as the output value of the filter center. Thereby, the degree of coincidence with a specific edge pattern can be calculated.

ここでRectangle Filterのように単純な形のフィルタを用いる理由はIntegral Imageと呼ばれる高速化が利用できるからである。Integral Imageについて詳しくは触れないが、本来ならば、各矩形に含まれる画素の平均値を算出するのに（領域に含まれる画素数）回の和算が必要であるのに対し、Integral Imageを利用することによって、3回の和算を行うだけで前記平均値を算出できるため非常に高速な計算が可能である。これらの計算を図で挙げた全てのフィルタを様々な大きさに変化させながら行う。その結果、入力画像の各点において、多次元のベクトルが得られることになる。 The reason why a simple filter such as Rectangle Filter is used here is that an acceleration called Integral Image can be used. I won't go into detail about Integral Image, but originally, to calculate the average value of the pixels included in each rectangle (the number of pixels included in the region) times is required, whereas Integral Image By using the average value, the average value can be calculated only by performing the summation three times, so that a very high speed calculation is possible. These calculations are performed while changing all the filters listed in the figure to various sizes. As a result, a multidimensional vector is obtained at each point of the input image.

続いて、入力画像における、様々な大きさ、様々な位置の矩形を仮定し、それぞれの矩形に含まれるベクトルの値から、顔が存在するか否かを判定する。勿論の事ながら、その判定は非常に大きな次元のベクトルを扱うことになり、人間の手によって、判定手段を作成することは非常に困難である。そこで、正解か不正解かを二値的に判定する識別器を作成する、Adaboostと呼ばれる手法を用いる。 Subsequently, rectangles of various sizes and positions in the input image are assumed, and whether or not a face exists is determined from the vector values included in each rectangle. Of course, the determination involves handling a very large dimensional vector, and it is very difficult to create a determination means by human hands. Therefore, a technique called Adaboost, which creates a discriminator that binaryly determines whether the answer is correct or incorrect, is used.

Adaboostに対して様々なベクトルと正解値を学習データとして与えて学習させると、入力ベクトルが正解か不正解かを判定する識別器が得られる。本実施の形態における学習データにおいて上記rectangle filterの出力値がベクトルとなり、顔である画像が正解、顔以外の画像が不正解になる。 When various vectors and correct answer values are given as learning data for learning to Adaboost, a discriminator for determining whether the input vector is correct or incorrect is obtained. In the learning data in the present embodiment, the output value of the rectangle filter is a vector, and an image that is a face is correct and an image other than the face is incorrect.

ここで、正解画像として導入する顔画像について補足しておく。正解画像として、図８における(a)〜(d)のような様々な方向を向いた顔画像を与えておくと、作成された顔検出手段は、(a)〜(d)の様々なパターンを識別できるものになる。逆に(a)のパターンだけを正解画像として与え、(b)〜(d)を不正解画像として与えると、(a)のようなパターンの顔だけを検出できる。 Here, a supplementary description will be given of a face image to be introduced as a correct image. If face images in various directions such as (a) to (d) in FIG. 8 are given as correct images, the created face detection means has various patterns (a) to (d). Can be identified. Conversely, if only the pattern (a) is given as a correct image and (b) to (d) are given as incorrect images, only the face having the pattern as shown in (a) can be detected.

ここで、本実施の形態においては後者の(a)のパターンだけを正解データとして与えた。(a)の方向を向いた顔だけが検出できるので、携帯電話による撮影時に、撮影画像において運転免許証がどの方向を向いているかを簡易に判定できるからである。これによりサーバの負荷が減るだけでなく、サーバ側で運転免許証の記載内容を読み取る際に精度が良くなる。
次に、Adaboostの動作原理について述べる。 Here, in the present embodiment, only the latter pattern (a) is given as correct answer data. This is because only the face facing in the direction of (a) can be detected, so it is possible to easily determine which direction the driver's license is facing in the photographed image when photographing with the mobile phone. This not only reduces the load on the server, but also improves the accuracy when reading the description of the driver's license on the server side.
Next, the operation principle of Adaboost will be described.

Adaboost
Adaboostは分類規則の生成手段である。複数の貧弱な識別器の結果の重み付き多数決を採ることで高精度な判定結果を得ることができる。尚、ここでいう貧弱な識別器（弱識別器）と言う言葉は、それらを組み合わせて得られる高精度な識別器を強識別器と呼ぶのと対比するための言葉で、実際にはどんな識別器であっても構わない。更に言うなら、本実施の形態における顔検出手段は、学習により２値的な分類手段を得られる手法であれば何でも良い。例えばSVMやニューラルネットワークなどを用いても構わない。 Adaboost
Adaboost is a means for generating classification rules. A highly accurate determination result can be obtained by taking a weighted majority of the results of a plurality of poor classifiers. The term “poor classifier” (weak classifier) here is used to compare a high-precision classifier obtained by combining them with a strong classifier. It may be a vessel. In other words, the face detection means in this embodiment may be anything as long as it can obtain a binary classification means by learning. For example, SVM or neural network may be used.

まず、Adaboostによる分類規則の生成手法について述べる。図９はAdaboostによる分類規則の生成手法のフローチャートを示した図である。 First, a method for generating classification rules using Adaboost will be described. FIG. 9 is a diagram showing a flowchart of a classification rule generation method by Adaboost.

まずStep01ではサンプルに対する重みD₁(i)を初期化する。初期化の方法は、全ての重みの和が１になってさえいれば、自由であるが、今回はサンプル数ｍに対して各サンプルに1/mの重みを付けることとした。次にStep02では重みの付けられたサンプルを用いて弱識別器h_t(x_i)の学習を行う。非特許文献２では弱識別器の規定は無く、識別率が０．５を超えてさえいれば良いとされている。今回用いた弱識別器は特徴量次元から、或る特定の１次元を抽出し、閾値を設定する弱識別器を用いた。前述の様に特徴量は全て０〜１に正規化されているので、0. 01ステップなどで順次閾値をスキャンすれば最適な弱識別器が設定できる。 First, in Step 01, the weight D ₁ (i) for the sample is initialized. The initialization method is arbitrary as long as the sum of all the weights is 1, but this time, each sample is given a weight of 1 / m with respect to the number of samples m. Next, in Step 02, the weak classifier h _t (x _i ) is learned using the weighted samples. In Non-Patent Document 2, there is no definition of a weak classifier, and it is only necessary that the classification rate exceeds 0.5. The weak classifier used this time is a weak classifier that extracts a specific one dimension from the feature quantity dimension and sets a threshold value. As described above, since all the feature values are normalized to 0 to 1, an optimum weak classifier can be set by sequentially scanning the threshold value in 0.01 steps or the like.

続いてStep03では学習により得られた弱識別器により、全サンプルに対する識別を行い、間違い率ε_tを算出する。但し、この間違い率は間違えたサンプル数の割合でなく、間違えたサンプルに付けられた重みの和の割合である。前述の通り、重みは合計１に正規化されているので、間違えたサンプルに付けられた重みの和を採ることで、間違い率が算出できる。続いてStep04では前記間違い率ε_tを用いて、数２に基づいて弱識別器の重みαtを算出する。数２は優秀な弱識別器には高い重みを付け、劣等な弱識別器には低い重みが付くようになっている。 Subsequently, in Step 03, all samples are identified by the weak classifier obtained by learning, and the error rate ε _t is calculated. However, this error rate is not the ratio of the number of mistaken samples, but the percentage of the sum of the weights assigned to the wrong samples. As described above, since the weights are normalized to 1 in total, the error rate can be calculated by taking the sum of the weights attached to the wrong samples. Then using Step04 In the mistake rate epsilon _t, we calculate the weights αt of the weak classifiers based on the number 2. In Formula 2, an excellent weak classifier is given a high weight, and an inferior weak classifier is given a low weight.

続いてStep05では前記弱識別器の重みα_iを用いて、数３、数４に基づいてサンプルの重みを更新する。数３は今回の学習で間違えたサンプルの重みを重く、正解したサンプルの重みを小さくするように設計されている。尚、Ztは更新後の重みの和が１になるように正規化するための値である。 Subsequently, in Step 05, the weights of the samples are updated based on Equations 3 and 4 using the weights α _i of the weak classifiers. Equation 3 is designed to increase the weight of the sample mistaken in the current learning and decrease the weight of the correct sample. Zt is a value for normalization so that the sum of the updated weights becomes 1.

最後にStep06では収束確認を行う。例えばエラー率が０になった場合には収束したと言えるし、エラー率の変化が事前に設定した閾値（例えば1e-6等）の範囲を超えない場合にも収束したと言える。もし、収束しなかった場合にはもう一度Step02〜Step06迄の処理を繰り返す。もし、収束した場合には、処理を終了させる。 Finally, at Step 06, convergence is confirmed. For example, when the error rate becomes 0, it can be said that it has converged, and it can also be said that it has converged when the change in the error rate does not exceed a preset threshold (for example, 1e-6). If not converged, the process from Step 02 to Step 06 is repeated once again. If converged, the process is terminated.

最終的に算出された、複数の弱識別器h_i(x)と弱識別器の重みα_iを利用し、数式５に基づいて計算を行えば識別器として利用することが出来る。但し、数式５においてTは収束までに学習した回数、つまり識別に利用する弱識別器の数を示している。 If a plurality of weak classifiers h _i (x) and weak classifier weights α _i finally calculated are used and calculation is performed based on Equation 5, the classifiers can be used. In Equation 5, T represents the number of times of learning until convergence, that is, the number of weak classifiers used for identification.

あおり補正(図４-step104)
続いて図４-step104で示したあおり補正について述べる。あおり補正は図１０に示すようなフローによって達成される。 Tilt correction (Figure 4-step104)
Next, the tilt correction shown in FIG. 4-step 104 will be described. The tilt correction is achieved by a flow as shown in FIG.

まずエッジ検出手段300では、画像全体に図１１で示すようなフィルタを施し、画像からエッジ成分を算出し、更に二値化処理することによってエッジの二値画像を得る。以下エッジである点をエッジ点と呼ぶことにする。 First, the edge detection means 300 applies a filter as shown in FIG. 11 to the entire image, calculates an edge component from the image, and further performs binarization processing to obtain a binary image of the edge. Hereinafter, a point that is an edge will be referred to as an edge point.

続いて、直線検出手段301は、画像に含まれる直線を検出する。ここでは一般的な直線検出アルゴリズムであるHough変換を利用するものとする。尚、ここではHough変換を利用したが、直線が検出できるアルゴリズムならば何でも構わない。Hough変換ではエッジ点の座標x, yを以下の式に代入してrθ平面状での曲線を得る。 Subsequently, the straight line detection unit 301 detects a straight line included in the image. Here, it is assumed that Hough transform, which is a general straight line detection algorithm, is used. Although the Hough transform is used here, any algorithm that can detect a straight line may be used. In the Hough transform, an edge point coordinate x, y is substituted into the following equation to obtain a curve in the rθ plane.

全てのエッジ点の座標を数６に代入して、rθ平面状で曲線を描くと、それらの曲線が多数交わる点が現れる。図１２は、実際に多数の曲線を描いたものである。ここで、白い線が各曲線を表しており、明るい点ほど多数の曲線が交わる点であるといえる。 When the coordinates of all the edge points are substituted into Equation 6 and a curve is drawn on the rθ plane, a point where many of these curves intersect will appear. FIG. 12 actually depicts a large number of curves. Here, the white line represents each curve, and it can be said that a brighter point is a point where many curves intersect.

この明るい点を示すｒとθを以下の数７に代入すると所望の直線が得られる。一方、実際には、図１２からも明らかなように明るい点ではなく、明るい領域が得られてしまうため、局所領域で最も明るい領域を求めて、直線を示すrとθを得るものとする。即ち、該rθ平面への描画の後、rθ平面において注目画素を順次スキャンし、注目画素の周辺５×５画素において、注目画素が最も白い点であったrとθにより表現される直線（以下の式参照）が直線の候補となる。 A desired straight line can be obtained by substituting r and θ indicating the bright point into the following equation (7). On the other hand, actually, as is clear from FIG. 12, a bright area is obtained instead of a bright point. Therefore, the brightest area is obtained in the local area, and r and θ indicating straight lines are obtained. That is, after drawing on the rθ plane, the target pixel is sequentially scanned on the rθ plane, and a straight line (hereinafter referred to as r and θ) where the target pixel is the whitest point in the 5 × 5 pixels around the target pixel (See the equation) is a straight line candidate.

尚、図１２からも明らかなように、直線の候補は複数現れる。そのため、交点検出手段302では、まず、検出された全ての直線候補の交点を算出する。算出された交点の中で、画像中で最も左上に有るものと、右上に有るものと、最も左下に有るものと、右下に有るものを、補正前の４点として出力する。尚、この４点は運転免許証の４隅が取れることを期待している。 As is clear from FIG. 12, a plurality of straight line candidates appear. Therefore, the intersection detection unit 302 first calculates intersections of all detected straight line candidates. Among the calculated intersections, the one at the top left, the top right, the bottom left, and the bottom right in the image are output as four points before correction. These four points are expected to get four corners of the driver's license.

続いて、補正後４点算出手段303では、前記補正前の４点の座標を元に、補正後の４点を算出する。補正後の４点の算出方法は、図１３で示すように各点の中点を通る垂直、もしくは水平な直線を４辺とする長方形の各頂点である。図中白丸で示した点が補正前の４点であり、黒丸で示した点が補正後の４点である。 Subsequently, the corrected 4-point calculation means 303 calculates the corrected 4 points based on the coordinates of the 4 points before the correction. As shown in FIG. 13, the correction method of the four points after correction is each vertex of a rectangle having four sides of a vertical or horizontal straight line passing through the middle point of each point. The points indicated by white circles in the figure are the four points before correction, and the points indicated by black circles are the four points after correction.

以降、補正前の４点の座標を(x₀, y₀)〜(x₃, y₃)とし、補正後の４点の座標を(x'₀, y'₀)〜(x'₃, y'₃)として話を続ける。 Thereafter, the coordinates of the four points before correction are (x ₀ , y ₀ ) to (x ₃ , y ₃ ), and the coordinates of the four points after correction are (x ′ ₀ , y ′ ₀ ) to (x ′ ₃ , Continue talking as y ' ₃ ).

続いてマトリクス算出手段304では、数８の８元連立方程式を解くことにより求める。 Subsequently, the matrix calculation means 304 obtains it by solving the 8-ary simultaneous equations of Equation 8.

この８元連立方程式をGauss-Jordan法等で解くと変換マトリクスを求めることができる。 The conversion matrix can be obtained by solving the quaternary simultaneous equations by the Gauss-Jordan method or the like.

ここで得られた変換マトリクスをマトリクス演算手段305で画像全体に施すことによりあおり補正が完了する。 The tilt correction is completed by applying the conversion matrix obtained here to the entire image by the matrix calculation means 305.

写真領域判定手段(図４-step105)
続いて図４のstep105で説明した写真領域特定手段について述べる。写真領域の特定は図１４に示すようなフローによって左、右、上、下の各辺を検出することによりなされる。まず左辺の検出を例に挙げて説明する。 Photo area determination means (Fig.4-step105)
Next, the photographic area specifying means described in step 105 of FIG. 4 will be described. The photo area is specified by detecting the left, right, top, and bottom sides according to the flow shown in FIG. First, the left side detection will be described as an example.

まず左辺は縦エッジなので、縦のエッジを検出する。具体的には図１５の(a)のような縦エッジに強く応答するようなフィルタを全面に施す。 First, since the left side is a vertical edge, a vertical edge is detected. Specifically, a filter that strongly responds to the vertical edge as shown in FIG.

続いて二値化処理を行って(401)、エッジ画素と非エッジ画素を確定する。エッジカウント402ではまず、検出した顔の領域を参考に、図１６の点線で示したような探索範囲を決定する。該探索範囲において、縦方向にエッジをカウントする。結果各xにおいて縦エッジ画素がどれだけ存在するかをカウントする。その後、最大カウント探索403では最も縦エッジ画素が存在するx座標の縦エッジ画素数を探索する。その後、各x座標の縦エッジ数と、最大縦エッジ数を比較し、最大エッジ数の80％以上を写真縁候補とする。同様に右辺を検出する。 Subsequently, binarization processing is performed (401) to determine edge pixels and non-edge pixels. In the edge count 402, first, a search range as indicated by a dotted line in FIG. 16 is determined with reference to the detected face area. In the search range, the edges are counted in the vertical direction. Result: Count how many vertical edge pixels exist in each x. Thereafter, in the maximum count search 403, the number of vertical edge pixels of the x coordinate where the vertical edge pixel is most present is searched. Thereafter, the number of vertical edges of each x coordinate is compared with the maximum number of vertical edges, and 80% or more of the maximum number of edges is set as a photo edge candidate. Similarly, the right side is detected.

また、上辺と下辺についてはエッジ検出のフィルタとして図１５の(b)のようなフィルタを用い、更にエッジのカウントを縦方向ではなく、横方向にすれば、同様に写真縁候補が得られる。尚、写真縁はエッジカウントが最大の点だけをピックアップするのではなく、最大の８０％を超えていれば候補点とするため、写真領域の候補は前述の通り複数現れる。 Further, for the upper side and the lower side, if a filter as shown in FIG. 15B is used as an edge detection filter and the edge count is changed to the horizontal direction instead of the vertical direction, a photographic edge candidate can be similarly obtained. Note that the photograph edge does not pick up only the point with the maximum edge count, but is a candidate point if it exceeds 80% of the maximum, so a plurality of photograph region candidates appear as described above.

サーバ側から見たシステムの動作
続いて、サーバ側から見たシステムの動作について説明する。以下の図１７は該実施の形態のシステムにおけるサーバの具備する機能を一覧にしたものである。まずサーバは通信部107を具備し、インターネットを通じて様々なユーザのPCと通信することができる。またサーバには商品の情報を格納する商品DBが存在する。該DBには表１に示すように、それぞれの商品ID、商品名、価格、画像URL、年齢認証が必要か否か、および、商品の画像が格納されている。 Operation of the system viewed from the server side Next, the operation of the system viewed from the server side will be described. FIG. 17 below lists the functions of the server in the system according to the embodiment. First, the server includes a communication unit 107 and can communicate with PCs of various users through the Internet. The server has a product DB for storing product information. As shown in Table 1, each DB stores a product ID, a product name, a price, an image URL, whether age authentication is necessary, and a product image.

ユーザがインターネットを通じて該サーバのページにアクセスすると、該サーバは商品情報送信部100を通じてユーザのPCへこれらの情報を送信する。これらの情報を閲覧したユーザが商品の検索を行った後に購入決定のボタンを押すと、サーバへは購入にいたった商品の商品IDが送信されてくる。 When the user accesses the page of the server through the Internet, the server transmits the information to the user's PC through the product information transmission unit 100. When a user who has browsed these pieces of information searches for a product and then presses a purchase decision button, the product ID of the product that has been purchased is transmitted to the server.

該サーバは、送信された商品IDを商品DB内で検索し、該商品は成人認証を必要としているか否かを年齢認証要否判定部101にて判定する。年齢認証が必要でない場合、該サーバはフォーム生成・送信部102により、フォームを生成し、通信部105とインターネットを介してユーザの閲覧しているwebページへ送信する。 The server searches the product DB for the transmitted product ID, and the age authentication necessity determination unit 101 determines whether or not the product requires adult authentication. When age authentication is not necessary, the server generates a form by the form generation / transmission unit 102 and transmits the form to the web page being browsed by the user via the communication unit 105 and the Internet.

一方、年齢認証が必要である場合には、該サーバはQRコード作成・送信部を用いてQRコードを作成しさ、ユーザが閲覧中のPCへQRコードを送信する。ここで、QRコードには該サーバに保持される運転免許証画像送信アプリケーションをダウンロードするためのアドレスが含まれているものとし、ユーザがカメラで該QRコードを撮影し、QRコードの内容を読み取ると、該アドレスに簡単にアクセスできるものとする。 On the other hand, when age verification is required, the server creates a QR code using the QR code creation / transmission unit, and transmits the QR code to the PC being browsed by the user. Here, it is assumed that the QR code includes an address for downloading the driver's license image transmission application held in the server, and the user captures the QR code with a camera and reads the content of the QR code. It is assumed that the address can be easily accessed.

ユーザが該アドレスにアクセスすると、該サーバはアプリケーション送信部105を通じて、保持している前記運転免許証画像送信アプリケーションをユーザへと送信する。 When the user accesses the address, the server transmits the held driver's license image transmission application to the user through the application transmission unit 105.

ユーザが、運転免許証画像送信アプリケーションを通じて免許証の画像を送信してくると、該サーバは免許証読み取り部106を用いて、免許証の画像からユーザの生年月日を読み取る。該免許証読み取り部106の動作については後述する。該サーバは読み取った生年月日を元にユーザの年齢を算出し、ユーザの年齢が二十歳を超えていた場合、フォーム生成・送信部102を用いて、フォームを生成し、通信部105とインターネットを介してユーザの閲覧しているwebページへ送信する。尚、後述するが、生年月日の読み取りと同様の手段で、住所や氏名の読み取りも行うことができるので、該サーバが住所や氏名を読み取って、生成するフォームに予め入力しておけば、ユーザの手間が省け、ユーザの利便性が向上する。 When the user transmits a license image through the driver's license image transmission application, the server uses the license reading unit 106 to read the date of birth of the user from the license image. The operation of the license reading unit 106 will be described later. The server calculates the user's age based on the date of birth read, and if the user's age is over 20 years, the form is generated using the form generation / transmission unit 102, and the communication unit 105 Send to the web page that the user is browsing via the Internet. As will be described later, since the address and name can also be read by the same means as reading the date of birth, if the server reads the address and name and inputs in advance to the form to be generated, This saves the user trouble and improves user convenience.

ユーザにより必要事項の記入されたフォームが送信されてきた場合、該サーバはフォーム受信部103で該フォームを受け取り、注文内容DBへと格納する。該DBを参照し、商品を提供する人間が、該フォームに入力された内容に基づいてユーザに代金を請求するとともに、ユーザへ商品を発送することで買い物は終了する。 When a form in which necessary items are filled in is transmitted by the user, the server receives the form at the form receiving unit 103 and stores it in the order content DB. The person who provides the product with reference to the DB charges the user for the price based on the contents entered in the form and sends the product to the user, thereby completing the shopping.

免許証読み取り部106
図１７で示した免許証読み取り部106の動作について図１８のフロー図を用いて説明する。 License reading section 106
The operation of the license reading unit 106 shown in FIG. 17 will be described with reference to the flowchart of FIG.

まず、領域特定手段500により免許証画像において生年月日が記載されている領域を該サーバが特定できる。既に述べたように、これまでの処理で、免許証画像中でどこに顔写真が存在するかは該サーバに対して明らかになっている。また、免許証のフォーマットは（少なくとも都道府県内では）一定である。そのため、顔写真の左上に対して、生年月日が記載される領域の左上との位置関係、および図１９に示した矩形領域の大きさは一定である。そのため、写真領域の位置から、簡単に領域を生年月日の記載された領域を該サーバが特定できる。 First, the server can specify the region where the date of birth is described in the license image by the region specifying means 500. As described above, it has been revealed to the server where the face photograph exists in the license image by the processing so far. Also, the license format is fixed (at least in prefectures). Therefore, the positional relationship between the upper left of the face photograph and the upper left of the region where the date of birth is described, and the size of the rectangular region shown in FIG. 19 are constant. For this reason, the server can easily specify the region in which the date of birth is described from the position of the photo region.

このように、簡単に領域を特定できるのは、本実施の形態で該サーバがあおり補正を行っているだけでなく、顔写真領域を用いた正確なアスペト比補正を行っているから実現できることである。もし、あおり補正を行っていない場合、アスペクト比が補正されていないことを考えると、顔写真の位置がわかっても、生年月日が記載された領域は容易にわからない。 In this way, it is possible to easily specify the area because not only the server corrects the correction in the present embodiment but also corrects the aspect ratio using the face photograph area. is there. If tilt correction is not performed, considering that the aspect ratio is not corrected, even if the position of the face photo is known, the region where the date of birth is described is not easily known.

続いて、文字切り出し手段501では、特定した領域に含まれる画像を該サーバが一文字ずつの画像に切り分ける。更に、文字照合手段502では文字DBを参照し、パタンマッチで切り出した文字それぞれが、どの文字に近いかを判定する。 Subsequently, in the character cutout unit 501, the server cuts the image included in the specified area into images of each character. Further, the character collating unit 502 refers to the character DB, and determines which character each character cut out by pattern matching is close to.

第一の実施の形態の効果
ここまで第一の実施の形態について説明してきた。従来技術で説明したとおり、本来被写体(身分証明書)との距離、および免許証とカメラとのなす角度が明らかにならない限り、正確なアスペクト比を保持したあおり補正は不可能であった。一方、本実施の形態では、定型フォーマットである免許証についてならば、定型フォーマットのサイズ情報を利用することで、正確なアスペクト比を保持したあおり補正を自動で行うことが出来ることを示した。更に、定型フォーマットの中でも、顔、および顔写真に着目したのは、免許証などに用いられる顔写真が顔検出にとって極めて好適であるためである。更にいえば、本実施の形態で示した非特許文献２記載の手法は現在、デジタルカメラなどで広く採用されている手法であり、認識精度は極めて高い。これらのことから、免許証に記載されている顔の検出は極めて高い精度で行える。
また、この構成により、運転免許証に対して正確なあおり補正を行うことができるため、顔の位置から免許証の記載事項が書かれた領域を極めて良好に特定することができる。
本発明は運転免許証に限定して説明したが、定型フォーマットかつ、顔写真が貼付されている身分証明書にならばなんでも適用可能である。 Effect of First Embodiment The first embodiment has been described so far. As described in the prior art, it is impossible to correct the tilt while maintaining an accurate aspect ratio unless the distance from the subject (identification card) and the angle between the license and the camera are unclear. On the other hand, in the present embodiment, for a license that is a fixed format, it has been shown that tilt correction with an accurate aspect ratio can be automatically performed by using the size information of the fixed format. Furthermore, among the standard formats, attention was paid to the face and the face photograph because the face photograph used for a license is extremely suitable for face detection. Furthermore, the technique described in Non-Patent Document 2 shown in the present embodiment is a technique that is currently widely used in digital cameras and the like, and the recognition accuracy is extremely high. For these reasons, the face detection described in the license can be detected with extremely high accuracy.
Also, with this configuration, it is possible to perform accurate tilt correction on the driver's license, so it is possible to very well identify the area where the description items of the license are written from the face position.
Although the present invention has been described only with respect to a driver's license, the present invention can be applied to any identification card that has a fixed format and a face photograph attached thereto.

本発明の第二の実施の形態（携帯電話で顔検出だけは行う）
第二の実施の形態で述べる身分証明書撮影システムは、ユーザ側から見たシステムの動作、サーバ側から見たシステムの動作はほぼ第一の実施の形態で述べた身分証明書撮影システムと同じであるが、顔検出や、あおり補正といった処理がサーバ側に実装されている点で異なる。それにより第一の実施の形態と異なる点である、運転免許撮影アプリケーションの動作と、免許証読み取り部の動作について述べる。 Second embodiment of the present invention (only face detection is performed with a mobile phone)
In the ID card photographing system described in the second embodiment, the system operation viewed from the user side and the system operation viewed from the server side are substantially the same as the ID card shooting system described in the first embodiment. However, the difference is that processing such as face detection and tilt correction is implemented on the server side. Accordingly, the operation of the driver's license photographing application and the operation of the license reading unit, which are different from the first embodiment, will be described.

運転免許証撮影アプリケーションの動作
図２０は運転免許証アプリケーションの動作を示すものである。ユーザが撮影を開始すると、携帯電話に付属しているカメラによって画像の取得がなされる(step200)。尚、撮影が開始された場合には、第一の実施の形態同様、該アプリケーションが自動的にカメラをマクロモードにすることが望ましい。また、本実施の形態では、ユーザが撮影した画像が成人識別に好適であるかが、全く判らないので、撮影時に該アプリケーションが携帯電話のディスプレイ上に免許証の概形を表示し、該概形に合わせて撮影することをユーザに促すと良い。こうすることで、ユーザは画面においてどの方向に免許証を置いて撮影すればよいのか、どの程度の大きさに撮影すればよいのかがわかる。すると、撮影される画像は比較的好条件のものになるので、以下の処理の精度が高まる。撮影した画像はサーバへ送信される(step201)。 Operation of Driver's License Shooting Application FIG. 20 shows the operation of the driver's license application. When the user starts shooting, an image is acquired by a camera attached to the mobile phone (step 200). When shooting is started, it is desirable that the application automatically sets the camera to the macro mode, as in the first embodiment. Further, in this embodiment, since it is not known at all whether the image taken by the user is suitable for adult identification, the application displays the outline of the license on the display of the mobile phone at the time of shooting. The user should be encouraged to shoot according to the shape. By doing so, the user can know in which direction the license should be placed on the screen and what size should be taken. Then, since the photographed image has relatively favorable conditions, the accuracy of the following processing is increased. The captured image is transmitted to the server (step 201).

免許証読み取り部の動作(図１７-106)
基本的には第一の実施の形態と同様であるが、第一の実施の形態でカメラ側のアプリケーションの処理であった顔検出(step101)からアスペクト比補正(step106)までの処理をサーバに行わせる点で異なる（図４参照）。殆どの処理は図４を用いて説明したものと同じであるが、写真領域特定(step105)だけは若干異なる。図２１は写真領域特定処理のフローチャートであるが、404の判定手段において、第一の実施の形態では、最大カウントの80%以上であるならアプリケーションが写真領域の候補とするとし、複数の候補からユーザに選択させるという構成となっていたが、本実施の形態では、最大カウントの場合に即、サーバが写真領域として決定してしまう。このような処理にしたのは、処理がサーバ上で行われ、ユーザからのインタラクティブな操作が不可能であるからである。 Operation of license reading part (Fig.17-106)
Basically the same as in the first embodiment, but the processing from face detection (step 101) to aspect ratio correction (step 106), which was the processing of the application on the camera side in the first embodiment, is performed on the server. It differs in that it is performed (see FIG. 4). Most of the processing is the same as that described with reference to FIG. 4, but only the photo region specification (step 105) is slightly different. FIG. 21 is a flowchart of the photograph area specifying process. In the determination means 404, in the first embodiment, if the maximum count is 80% or more, the application is assumed to be a photograph area candidate. Although the configuration is such that the user selects it, in this embodiment, the server immediately determines the photo area when the maximum count is reached. Such a process is performed because the process is performed on the server and an interactive operation from the user is impossible.

第二の実施の形態の効果
以上のように第二の実施の形態では、第一の実施の形態のように写真領域の候補をユーザが選択するなどといったインタラクティブな動作がない分、認識の精度は落ちるが、一般的に貧弱な処理能力しか持たない携帯電話での処理を最小限に抑えているので、高速に動作し、ユーザの利便性は高まる。 Effects of Second Embodiment As described above, in the second embodiment, recognition accuracy is reduced because there is no interactive operation such as the user selecting a photo area candidate as in the first embodiment. However, since processing on a mobile phone having generally poor processing capability is kept to a minimum, it operates at high speed and the convenience of the user is enhanced.

本発明の第三の実施の形態
第三の実施の形態で述べる身分証明書撮影システムは第一の実施の形態で述べた身分証明書撮影システムとほぼ同じ構成であるが、図４のあおり補正(step104)の動作が異なる。第一の実施の形態の図１０で示したあおり補正では、画像全体にエッジ検出処理、Hough変換、交点検出処理を行った。一方、本実施の形態では、該処理を施す範囲を図２２中の太線で囲われた範囲のみにする。尚、太点線は顔検出により検出された顔の範囲であり、処理を施す領域は、顔の範囲と重心を変えずに縦横２倍にした範囲とする。 Third Embodiment of the Present Invention The ID card photographing system described in the third embodiment has substantially the same configuration as the ID card photographing system described in the first embodiment, but the tilt correction shown in FIG. The operation of (step 104) is different. In the tilt correction shown in FIG. 10 of the first embodiment, edge detection processing, Hough conversion, and intersection detection processing are performed on the entire image. On the other hand, in the present embodiment, the range to be subjected to the processing is limited to the range surrounded by the thick line in FIG. Note that the bold dotted line is the face range detected by face detection, and the area to be processed is a range that is doubled vertically and horizontally without changing the face range and the center of gravity.

第一の実施の形態の図１０で示したあおり補正では、免許証の４隅が取れることを期待して、全面への処理を行ったが、本実施の形態では、写真領域の４隅が取れることを期待して処理を行っている。経験的に広い範囲（免許証の４隅）を見てあおり補正を行った方が、精度の良いあおり補正が行えるが、本実施の形態のように狭い範囲（写真領域の４隅）を見て、あおり補正を行ったほうが、処理量は大幅に削減できる。 In the tilt correction shown in FIG. 10 of the first embodiment, processing is performed on the entire surface in the hope that four corners of the license can be obtained. However, in this embodiment, four corners of the photographic area are displayed. Processing is performed with the expectation that it will be removed. It is better to correct the tilt by looking at a wide range (4 corners of the license) empirically, but the narrow range (4 corners of the photographic area) is seen as in this embodiment. Thus, the amount of processing can be greatly reduced by performing tilt correction.

以上、これまでの各実施の形態では、携帯電話とサーバを連携させて身分の証明を行うことを説明してきたが、各構成要件をどちらに携帯電話、サーバのどちらに担当させても構わない。例えば、それらの処理を行う機能を全て携帯電話に具備することも可能である。例えば、身分証明を行うソフトとしてユーザに提供し、そのソフトにより認証が行えた場合にのみ、所定の物品販売するといったことも実現可能である。
また、同様に全ての処理を一切サーバにやらせてしまっても問題は無い。サーバにはユーザから送られてきた身分証明書が撮影された画像が送られてきて、それに処理を行うことによって身分証明を行い、所定の物品を販売するといったことも実現可能である。 As described above, in each of the embodiments so far, the identification of the identity is performed by linking the mobile phone and the server. However, either the mobile phone or the server may be responsible for each component. . For example, the mobile phone can be provided with all functions for performing these processes. For example, it is also possible to provide a user as identification software and sell a predetermined item only when authentication is performed by the software.
Similarly, there is no problem if the server does all processing. It is also possible to send an image of the ID card sent from the user to the server, and to process the image to perform identification and sell a predetermined article.

以下、各請求項の作用効果について説明する。 Hereinafter, the function and effect of each claim will be described.

請求項１等の発明によれば、運転免許証の写真という定型であることが既知の領域を検出し、該領域の幅と高さを用いてあおり補正後のアスペクト比補正を行うことにより、正確な補正を行うことができ、免許証の内容を読み取る際の精度が向上する。 According to the invention of claim 1 and the like, by detecting a region known to be a fixed form of a photograph of a driver's license and performing the aspect ratio correction after correcting the tilt using the width and height of the region, Accurate correction can be performed, and the accuracy when reading the contents of the license is improved.

請求項２等の発明によればあおり補正後の画像に対して顔検出処理を施すため、顔検出の精度が向上し、免許証の内容を読み取る際の精度が向上する。 According to the invention of claim 2 and the like, the face detection process is performed on the tilt-corrected image, so that the accuracy of face detection is improved and the accuracy when reading the contents of the license is improved.

請求項３等の発明によれば、顔検出を行い、撮影された画像が読み取りに適するか否かを判定してから、比較的重いあおり補正を行うため、読み取りに適した画像が得られるまで、重い処理を何度も繰り返すことが無くなり、ユーザの利便性が向上する。 According to the invention of claim 3 and the like, since face detection is performed and it is determined whether or not the photographed image is suitable for reading, since relatively heavy tilt correction is performed, until an image suitable for reading is obtained. The heavy processing is not repeated many times, and the convenience for the user is improved.

請求項４等の発明に拠れば、顔検出が行える顔の向きは一定なので、ユーザが間違えた配置で撮影を行うことにより年齢の認証が行えないといった不具合が無くなり、ユーザの利便性が向上する。 According to the invention of claim 4 or the like, since the face direction in which face detection can be performed is constant, there is no problem that age authentication cannot be performed by photographing with the wrong arrangement, and convenience for the user is improved. .

請求項５等の発明によれば、複数の顔写真領域の候補からユーザに選択させることで、ユーザに負荷をかけることなく正確に顔写真領域を特定できるため正確なアスペクト比の補正ができる。 According to the invention of claim 5 and the like, by allowing the user to select from a plurality of face photo area candidates, the face photo area can be specified accurately without imposing a load on the user, so that an accurate aspect ratio can be corrected.

請求項６等の発明によれば、写真の位置と大きさが成人認証に好適な場合にだけ読み取りを行うことにより、小さい画像や、必要な領域が切れてしまった画像に対して、読み取り処理を行わないため、読み込めないために、ユーザに再度の撮影を促すなどといった処理を経ることなく読み取りが完了するため、ユーザの利便性が向上する。 According to the invention of claim 6 or the like, reading processing is performed for a small image or an image in which a necessary area is cut off by performing reading only when the position and size of the photograph are suitable for adult authentication. Therefore, the user's convenience is improved because reading is completed without performing processing such as prompting the user to take another image.

請求項７等の発明によれば、運転免許証の撮影が開始すると同時に自動的にマクロモードになるため、十分に免許証に近づいて撮影を行っても、ピントのあった画像が得られるため読み取りの精度が向上する。 According to the invention of claim 7 and the like, since the macro mode is automatically set at the same time as the shooting of the driver's license is started, a focused image can be obtained even if the shooting is sufficiently close to the license. Improves reading accuracy.

請求項８等の発明によれば、比較的重い処理であるOCR処理を、サーバなどのリッチな処理能力の有る端末で行うため、携帯電話の処理系に過度な処理能力を要求せず、ユーザにとってはシステム導入の敷居が下がる。 According to the invention of claim 8 and the like, since OCR processing, which is relatively heavy processing, is performed by a terminal having a rich processing capability such as a server, the processing system of the mobile phone is not required to have excessive processing capability, and the user For this, the threshold for system introduction is lowered.

請求項９等の発明によれば、顔検出手段によって得られた顔領域からあおり補正を行うための処理を施す範囲を大幅に小さくすることができるため、携帯電話の処理系に過度な処理能力を要求せず、ユーザにとってはシステム導入の敷居が下がる。 According to the invention of claim 9 and the like, since the range for performing the correction for the tilt correction from the face area obtained by the face detection means can be significantly reduced, the processing system of the cellular phone is excessively processed. The threshold for system introduction is lowered for the user.

請求項１０等の発明によれば、撮影時に、撮影画面に身分証明書の概形が現れ、該概形に合わせて身分証明書を撮影すればよいので、画面内で身分証明書がどう撮影されていれば良いのかが直感的に判るためユーザの利便性が向上する。 According to the invention of claim 10 or the like, the outline of the identification card appears on the shooting screen at the time of shooting, and it is only necessary to take a picture of the identification card according to the outline. The user convenience is improved because it is intuitively understood whether or not it should be done.

なお、上述する各実施の形態は、本発明の好適な実施の形態であり、本発明の要旨を逸脱しない範囲内において種々の変更実施が可能である。 Each embodiment described above is a preferred embodiment of the present invention, and various modifications can be made without departing from the gist of the present invention.

特開２００３−０３０７２６号公報JP 2003-030726 A 特開２００１−０８４３６５号公報Japanese Patent Laid-Open No. 2001-084365 特開平０９−２８９６００号公報JP 09-289600 A 特開２００１−１７７７１６号公報JP 2001-177716 A

Claims

An imaging device that shoots an identification card, a face detection unit that determines whether or not a face is included in an image, a photo detection unit that detects a photo area including a face, and a tilt correction that corrects a tilt of the entire image And when the face detection unit detects a face in the image obtained by the imaging device, the photo detection unit detects a photo area, and the photo detection unit An ID card photographing system that corrects the aspect ratio of an image after tilt correction using the size.

2. The identification card photographing system according to claim 1, wherein face detection processing is performed on the image after tilt correction.

2. The identification card photographing system according to claim 1, wherein face detection processing is performed on an image before tilt correction.

4. The identity according to claim 2, wherein the face detection means is capable of detecting a face only when the image is picked up by an image pickup apparatus and is substantially in a predetermined direction. Certificate shooting system.

5. The display device according to claim 4, further comprising display means for displaying the detected face photograph area candidates, and when there are a plurality of candidates, the display means displays a plurality of results and allows the user to select one. Identification card taking system.

6. The identification card photographing system according to claim 5, wherein it is determined whether or not reading of the description items of the identification card is started according to the size and position of the photo area.

The identification image capturing system according to claim 1, wherein the image capturing apparatus automatically enters a macro mode during image capturing.

2. The identification card according to claim 1, further comprising an OCR unit that reads items described in the identification card, wherein the OCR unit is mounted on hardware different from the imaging device. Shooting system.

2. The identification card photographing system according to claim 1, wherein the tilt correction means corrects the tilt according to the photograph area obtained by the photograph detection means.

2. The identification card photographing system according to claim 1, wherein the outline of the identification card is displayed on the display means at the time of photographing, and the user is encouraged to photograph according to the outline.

An imaging device that shoots an identification card, a face detection unit that determines whether or not a face is included in an image, a photo detection unit that detects a photo area including a face, and a tilt correction that corrects a tilt of the entire image An identification image photographing method in an identification image photographing system, wherein a face is detected by the photo detection means when a face is detected by the face detection means in an image obtained by the imaging device. A method of taking an identification card, comprising: detecting and correcting an aspect ratio of an image after tilt correction using a size of a photograph obtained by the photograph detecting means.

The identification image capturing method according to claim 11, wherein face detection processing is performed on the image after tilt correction.

The identification image capturing method according to claim 11, wherein face detection processing is performed on an image before tilt correction.

14. The identity according to claim 12, wherein the face detection means is capable of detecting a face only when the image is picked up by an image pickup apparatus and is substantially in a predetermined direction. Certificate shooting method.

The identification card photographing system further includes a display unit, and displays the detected face photo area candidates. When there are a plurality of candidates, the display unit displays a plurality of results and allows the user to select one. 15. The identification card photographing method according to claim 14.

16. The method of photographing an identification card according to claim 15, wherein it is determined whether or not to start reading the identification item according to the size and position of the photo area.

The identification image capturing method according to claim 11, wherein the image capturing apparatus automatically enters a macro mode at the time of capturing.

12. The identification card according to claim 11, further comprising an OCR unit that reads items described in the identification card, and the OCR unit is mounted on hardware different from the imaging device. Shooting method.

12. The method of photographing an identification card according to claim 11, wherein the tilt correcting means corrects the tilt according to the photograph area obtained by the photograph detecting means.

12. The method of photographing an identification card according to claim 11, wherein the outline of the identification card is displayed on the display means at the time of photographing, and the user is prompted to photograph according to the outline.

When a face is detected by a face detection unit that acquires an image obtained by an imaging device that shoots an identification card in a computer and determines whether or not a face is included in the image, a photo area including the face Detecting a photograph area by a photograph detecting means for detecting the image, and executing a process for correcting the aspect ratio of the image after tilt correction using the size of the photograph obtained by the photograph detecting means. program.

The program according to claim 21, wherein face detection processing is performed on an image after tilt correction.

The program according to claim 21, wherein face detection processing is performed on an image before tilt correction.

24. The program according to claim 22 or 23, wherein the face detection means can detect a face only when the image taken by the imaging apparatus is substantially in a predetermined direction. .

25. The program according to claim 24, further displaying the detected face photo area candidates, and when there are a plurality of candidates, the plurality of results are displayed on the display means to allow the user to select.

26. The program according to claim 25, wherein it is determined whether or not to start reading the description items of the identification card according to the size and position of the photo area.

23. The program according to claim 21, wherein the tilt correction unit performs a tilt correction according to a photograph area obtained by the photograph detection unit.

22. The program according to claim 21, wherein the outline of the identification card is displayed on the display means at the time of shooting, and the user is prompted to take a picture according to the outline.