JP2013097393A

JP2013097393A - System converting handwritten characters into text

Info

Publication number: JP2013097393A
Application number: JP2011236406A
Authority: JP
Inventors: Masahiro Miura; 雅弘三浦
Original assignee: O RID CO Ltd; O-RID CO Ltd
Current assignee: O RID CO Ltd; O-RID CO Ltd
Priority date: 2011-10-27
Filing date: 2011-10-27
Publication date: 2013-05-20

Abstract

PROBLEM TO BE SOLVED: To provide a system for converting handwritten characters into a text in which handwritten characters written in a note are read with high accuracy, and unauthorized use of the note can be prevented.SOLUTION: Processing to convert handwritten characters written in a note into text data can be performed with high accuracy by transmitting character image data photographed by a digital camera loaded on a terminal to a server using a communication line network, and converting the character image data into text data by a text data conversion processing part. Deletion of the handwritten characters and an alteration of a use frequency of a text data conversion processing service caused by copying of the note etc. can be prevented because the note is authenticated before the service is used using a two dimensional code added to the note.

Description

この発明は、手書き文字テキスト化システム、詳しくはノートに記入された手書き文字を、サーバのテキストデータ化処理部を利用してテキストデータ化する手書き文字テキスト化システムに関する。 The present invention relates to a handwritten character text conversion system, and more particularly to a handwritten character text conversion system that converts handwritten characters entered in a notebook into text data using a text data conversion processing unit of a server.

近年、手書きした内容を電子データ化する技術として、例えば、ノートに記入した手書き文字をスマートフォンのデジタルカメラにより撮影し、スマートフォンにインストールしたアプリケーションにより、文字画像データを画面に表示するとともにストレージに記憶させる株式会社キングジム製の「ショットノート」が開発されている（非特許文献１）。
非特許文献１では、紙面の四隅に印されたガイドマークを利用し、アプリケーションに搭載された歪補正部により文字画像データの歪みを補正するとともに、アプリケーションに搭載された簡易なＯＣＲ（ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）機能によって、ヘッダの日付と整理番号とのみを光学的に読み取ってテキストデータ化し、これらを文字画像データと関連付けてアプリケーションのストレージに記憶することで、ノートへの手書き内容の検索を容易としていた。 In recent years, as a technique for converting handwritten contents into electronic data, for example, handwritten characters entered in a notebook are photographed with a digital camera of a smartphone, and character image data is displayed on a screen and stored in a storage by an application installed on the smartphone. A “shot note” manufactured by King Jim Co., Ltd. has been developed (Non-Patent Document 1).
In Non-Patent Document 1, the guide marks marked at the four corners of the paper are used to correct the distortion of the character image data by the distortion correction unit mounted on the application, and a simple OCR (Optical Character Reader) mounted on the application. ) Function to optically read only the header date and reference number into text data and store them in the application storage in association with the character image data, making it easy to search for handwritten content in the notebook .

［平成２３年９月２５日検索］、インターネット〈ＵＲＬ：ｈｔｔｐ：／／ｗｗｗ．ｋｉｎｇｊｉｍ．ｃｏ．ｊｐ／ｓｐ／ｓｈｏｔｎｏｔｅ／〉[Search on September 25, 2011], Internet <URL: http: // www. kingjim. co. jp / sp / shotnote />

しかしながら、非特許文献１の「ショットノート」では、手書き文字を文字画像データのままで画面表示し、かつアプリケーションのストレージに記憶していた。そのため、例えばストレージに文字画像データとして記憶した議事録を、クライアントが電子メールを利用して関係者に文書で送信したい場合には、まずアプリケーションを起動し、テキストデータ化されたヘッダの日付や整理番号をインデックスとして、その議事録の文字画像データをスマートフォンのディスプレイに読み出し、それの内容をパソコンのワープロソフトにキーボードによって入力し、テキストデータ化する必要があった。
そこで、このようなクライアントに手入力を強いる煩わしさを解消するため、読み取り対象が数字のみという従来の簡易なＯＣＲではなく、手書き文字を光学的に読み取る高精度なＯＣＲを、アプリケーションに搭載することが考えられる。しかしながら、このように手書き文字も読み取れるＯＣＲでは、読み取り精度が十分とは言えなかった。 However, in the “shot note” of Non-Patent Document 1, handwritten characters are displayed on the screen as character image data and stored in the application storage. Therefore, for example, when the client wants to send the minutes stored as character image data in the storage to the related parties using e-mail as a document, the application is first started, and the date and arrangement of the header converted into text data are started. Using the number as an index, it was necessary to read the character image data of the minutes on the display of a smartphone, and input the contents into a word processing software on a personal computer using a keyboard to convert it into text data.
Therefore, in order to eliminate the annoyance of manually inputting such a client, the application is equipped with a high-accuracy OCR that optically reads handwritten characters, instead of the conventional simple OCR that only reads numbers. Can be considered. However, OCR that can read handwritten characters in this way cannot be said to have sufficient reading accuracy.

そこで、本発明者らは鋭意研究の結果、スマートフォンに搭載したデジタルカメラにより撮影した文字画像データを、通信回線網を利用してセンタのサーバに送信し、ここで文字画像データをテキストデータに変換すればよいことに想到した。すなわち、サーバにオペレータの端末から送信された読み取り結果としての（部分）テキストデータを利用する文字画像データのテキストデータ化処理部を設ければ、手書き文字（活字を含む）の高精度なテキストデータ化処理を行えることを知見するに至った。
しかしながら、サーバ側では、このテキストデータ化処理サービスをクライアントが利用する前に、クライアントが利用するノートの大きさや頁数などを把握しておく必要がある。その理由は、例えば手書き文字の消去やノートをコピーするといったテキストデータ化処理部の利用回数の不正を防止するためである。そこで、発明者は、ノートにＱＲコード（登録商標）などの２次元コードを印字し、これを利用して事前にノートのアクティベート（認証処理）をクライアント（ユーザ）に行わせ、その際、購入したノートのテキストデータ化利用回数の情報を取得するように構成すれば、上述した問題はすべて解消することを知見し、この発明を完成させた。 Therefore, as a result of diligent research, the present inventors transmit character image data captured by a digital camera mounted on a smartphone to a center server using a communication network, where the character image data is converted into text data. I thought of what I should do. That is, if a text data conversion processing unit for character image data that uses (partial) text data as a reading result transmitted from the operator's terminal is provided on the server, high-precision text data of handwritten characters (including printed characters) It came to know that it can be processed.
However, on the server side, before the client uses this text data conversion service, it is necessary to know the size of the notebook used by the client, the number of pages, and the like. The reason is to prevent illegal use of the text data processing unit such as erasing handwritten characters or copying notes. Therefore, the inventor prints a two-dimensional code such as a QR code (registered trademark) on a notebook and uses this to cause the client (user) to activate the notebook (authentication process) in advance. The present invention has been completed by discovering that all the above-mentioned problems can be solved if it is configured so as to obtain information on the number of times the notebook has been converted into text data.

この発明は、ノートに記入された手書き文字の高精度な読み取りが可能で、かつノートの不正利用も防止することができる手書き文字テキスト化システムを提供することを目的としている。 An object of the present invention is to provide a handwritten character text conversion system capable of reading a handwritten character written in a notebook with high accuracy and preventing unauthorized use of the notebook.

この発明は、文字の手書きスペースを有し、ノート認証用の２次元コードが表示された手書き用のノートと、前記手書きスペースに記入された手書き文字を撮影するデジタルカメラと、前記２次元コードを読み取る２次元コードリーダと、通信回線網を利用する端末側の通信部とを搭載した端末と、前記端末側の通信部よりそれぞれ送信された前記２次元コードリーダによる前記２次元コードの読み取りデータと、前記デジタルカメラによる前記手書き文字の文字画像データとを、前記通信回線網を利用して受信するサーバ側の通信部が搭載されたサーバと、前記サーバに設けられ、かつ前記２次元コードの読み取りデータを受信することで、前記ノートの利用を認証するとともに、該ノートによる前記テキストデータ化処理部の利用回数のデータを取得するノート認証部と、前記文字画像データを読み取ってテキストデータに変換するテキストデータ化処理部とを備えた手書き文字テキスト化システムである。 The present invention has a handwritten space for characters, a handwritten note on which a two-dimensional code for note authentication is displayed, a digital camera for photographing a handwritten character entered in the handwritten space, and the two-dimensional code. A terminal equipped with a two-dimensional code reader for reading, a terminal-side communication unit using a communication line network, and the two-dimensional code read data by the two-dimensional code reader respectively transmitted from the terminal-side communication unit; , A server on which a server-side communication unit that receives character image data of the handwritten characters by the digital camera using the communication network is mounted, and the server is provided with the two-dimensional code read By receiving data, the use of the note is authenticated, and the number of times the text data conversion processing unit uses the note A note authentication unit that acquires over data, a handwriting-to-text system comprising a text data processing unit for converting the text data by scanning the character image data.

ここでいう「ノート」とは、複数枚の紙（頁紙、用紙）を横綴じまたは縦綴じしたノートブック（帳面、冊子など）である。また、紙１枚だけのものでもよい。ノートサイズとしては、例えばＡ７，Ａ５，Ａ１が挙げられる。紙の手書きされる紙面（手書き面）は、例えば紙の表面である。
「手書き文字」とは、鉛筆、シャープペンシル、ボールペン、マジックペン、筆などの筆記具を用いて、人手により書かれた字（漢字、ひらがな、カタカナ、数字、アルファベットなど）で、印刷機により印刷された活字とは異なる。
「手書きスペース」とは、通常、紙の手書き面の全部または中央部のみであるが、例えば手書き面の上部のみ、下部のみまたは外周部のみでもよい。手書きスペースには、横罫や縦罫、マス目などを記入してもよい。 The “notebook” here is a notebook (book, booklet, etc.) in which a plurality of papers (page paper, paper) are bound or vertically bound. Further, only one sheet of paper may be used. Examples of the note size include A7, A5, and A1. The paper surface (handwritten surface) on which paper is handwritten is, for example, the surface of the paper.
“Handwritten characters” are characters (kanji, hiragana, katakana, numbers, alphabets, etc.) written by hand using writing instruments such as pencils, mechanical pencils, ballpoint pens, magic pens, and brushes. It is different from the type.
The “handwriting space” is usually all or only the center of the handwritten surface of the paper, but may be only the upper part, only the lower part or only the outer peripheral part of the handwritten surface, for example. You may enter a horizontal rule, a vertical rule, a grid, etc. in the handwriting space.

また、ノート紙面の撮影画像の歪みを、例えば端末のアプリケーションの歪補正機能により補正するようにしてもよい。補正法としては、台形補正法（射影変換）などを採用することができる。具体的には、デジタルカメラにより紙面を撮影し、その撮影画像の中から四隅のガイドマークを探し出し、発見した四隅のガイドマークを利用して、台形補正法によって、撮影画像を矩形状の整った紙面画像に変換する。これと同時に、端末のアプリケーションのノート種類特定機能により、四隅のガイドマークの形状や配列に基づき、読み取り対象のノートの種類（ノートサイズ、ノート等級）を特定する。 Further, the distortion of the photographed image on the notebook paper may be corrected by, for example, the distortion correction function of the application of the terminal. As a correction method, a trapezoidal correction method (projective transformation) or the like can be employed. Specifically, a digital camera is used to photograph the paper surface, find the four corner guide marks in the photographed image, and use the found four corner guide marks to make the photographed image rectangular by the trapezoidal correction method. Convert to paper image. At the same time, the note type specifying function of the terminal application specifies the type of note to be read (note size, note grade) based on the shape and arrangement of the guide marks at the four corners.

ここでいう「ノート認証」とは、アプリケーションのアクティベート機能により行われ、該当ノートに対して利用許可を与える処理である。
ここでいう「２次元コード（２次元バーコード）」とは、文字や数字などのデータを二次元の図形パターンとして紙面に刻印、印刷したもので、例えば、マトリックス型２次元コード（ＱＲコード）、スタック型２次元コードなどが挙げられる。
ノート上の２次元コードの表示位置は任意である。例えば、ノートの表紙の表面や裏面でも、各紙（頁紙）の紙面でも、表紙と１枚目の読み取り用紙との間に配置された認証専用紙の表面でもよい。この場合、認証専用紙の裏面にアクティベート方法を含むノートの利用方法の説明書きをしてもよい。特に、ノートの表紙をめくった１頁目（認証専用紙）に２次元コードを表示した方が、第三者が店内でノートを購入せずに携帯電話のデジタルカメラで撮影した後に、そのノートを購入したクライアントのアクティベートを妨害することを防止するとともに、利用しやすいという点から好ましい。 Here, “note authentication” is processing that is performed by an application activation function and that grants usage permission to the corresponding note.
The “two-dimensional code (two-dimensional bar code)” here is a data in which data such as characters and numbers is imprinted and printed on a paper surface as a two-dimensional graphic pattern, for example, a matrix type two-dimensional code (QR code). And a stack type two-dimensional code.
The display position of the two-dimensional code on the notebook is arbitrary. For example, it may be the front surface or back surface of the cover of a notebook, the surface of each sheet (page paper), or the surface of an authentication-dedicated sheet disposed between the cover and the first reading sheet. In this case, a description of how to use the notebook including the activation method may be provided on the back side of the authentication-dedicated paper. In particular, if a two-dimensional code is displayed on the first page (authentication-dedicated paper) that turns the cover of a notebook, the notebook will not be purchased by a third party in the store after taking a picture with a mobile phone digital camera. It is preferable from the viewpoint that it is easy to use while preventing activation of the client who purchased the client.

ここでいう端末とは、例えば個人用の携帯コンピュータの機能を併せ持った携帯電話（スマートフォン）の他、携帯コンピュータの機能を有さない携帯電話、パソコンなどを採用することができる。その他、クライアントが指定した別のパソコンでもよい。
デジタルカメラとしては、例えばＣＣＤカメラ、ＣＭＯＳカメラなどを採用することができる。
ここでいう「文字画像データ」とは、ノートに記入された手書き文字をデジタルカメラに撮影して得た手書き文字のイメージデータ（静止画像データ）である。例えば、ビットマップ・データ（ＢｉｔｍａｐＤａｔａ）などが挙げられる。 For example, a mobile phone (smart phone) that has the functions of a personal portable computer, a mobile phone that does not have a function of a mobile computer, a personal computer, and the like can be used as the terminal here. In addition, another personal computer designated by the client may be used.
As the digital camera, for example, a CCD camera or a CMOS camera can be employed.
The “character image data” here is handwritten character image data (still image data) obtained by photographing a handwritten character entered in a notebook with a digital camera. For example, bitmap data (Bitmap Data) can be used.

２次元コードリーダ（２次元バーコードリーダ）の種類は限定されない。
通信回線網としては、例えばインターネットなどを採用することができる。
端末側の通信部としては、無線伝送用の通信機器を採用することができる。また、サーバ側の通信部としては、有線伝送用の通信機器を採用することができる。一般的に、テキストデータはサーバ側のストレージに記憶し、必要時にクライアントが端末を使用して通信回線網からサーバにアクセスし、ストレージに記憶されたテキストデータを利用する。または、取得後のテキストデータをサーバから端末に返送するようにしてもよい。
テキストデータとは、レイアウト情報や修飾情報を持たない、純粋に文字のみで構成されるデータである。 The type of two-dimensional code reader (two-dimensional bar code reader) is not limited.
As the communication network, for example, the Internet can be employed.
As the terminal-side communication unit, a communication device for wireless transmission can be employed. In addition, a communication device for wired transmission can be employed as the server-side communication unit. In general, text data is stored in a storage on the server side, and when necessary, a client accesses the server from a communication network using a terminal and uses the text data stored in the storage. Alternatively, the acquired text data may be returned from the server to the terminal.
The text data is data composed purely of characters without layout information or decoration information.

ここでいう「テキストデータ化処理部」とは、例えば、２人以上のオペレータがノートの手書きスペースに記入された手書き文字を読み取り、その読み取り結果をオペレータ端末から送信（入力）することでテキストデータ化する処理部を採用することができる。その他、ノートのヘッダに手書きされた記入日（日付）や、フッタに記載された企業番号、整理番号だけでなく、手書きスペースの手書き文字を光学的に読み取ってテキストデータ化するＯＣＲでもよい。または、オペレータが介在する処理部とＯＣＲとの両方を併用したものでもよい。ＯＣＲ部を設けた場合には、テキストデータ化処理部に、ＯＣＲ部によるテキストデータ化が正しく行われたか否かを判定する２人以上のＯＣＲ判定オペレータのオペレータ端末から送信された判定結果が一致しているか否かを判定するＯＣＲ判定部を設けた方が望ましい。これにより、ＯＣＲ部による手書き文字の自動読み取りの精度が高まる。 Here, the “text data conversion processing unit” is, for example, text data obtained by two or more operators reading handwritten characters written in a handwritten space of a notebook and transmitting (inputting) the reading result from the operator terminal. A processing unit can be employed. In addition, not only the entry date (date) handwritten in the header of the notebook, the company number and the reference number written in the footer, but also OCR that optically reads handwritten characters in the handwritten space and converts them into text data may be used. Alternatively, a combination of both a processing unit with an operator and OCR may be used. When the OCR unit is provided, the determination result transmitted from the operator terminals of two or more OCR determination operators for determining whether or not the text data conversion by the OCR unit is correctly performed is stored in the text data conversion processing unit. It is desirable to provide an OCR determination unit that determines whether or not it is done. This increases the accuracy of automatic reading of handwritten characters by the OCR unit.

ここでいう「２次元コードの読み取りデータを受信することで、ノートの利用を認証する」とは、２次元コード中に図形パターン化されたノート情報（手書きスペース付きの紙面数など）を２次元コードリーダにより読み取り、対象ノートによるテキストデータ化処理部の利用を承認するものである。これに加えて、対象のノートを使用してテキストデータ化処理を何回利用できるかという情報を取得するものである。また、実際に発行しているコードかどうかも偽造防止のためチェックすることができる。
「ノート認証部」とは、アクティベーション（ライセンス認証）を行うソフトウエアである。 “Authenticating the use of a notebook by receiving the reading data of the two-dimensional code” here means that the notebook information (such as the number of papers with a handwritten space) that is patterned in the two-dimensional code is two-dimensional. It is read by a code reader and approves the use of the text data conversion processing unit by the target note. In addition to this, information on how many times the text data conversion processing can be used using the target note is acquired. It is also possible to check whether the code is actually issued to prevent forgery.
The “note authentication unit” is software that performs activation (license authentication).

また、この発明は、テキストデータ化処理部が、文字画像データを複数の部分文字画像データに分割する画像分割部と、１つの部分文字画像データを個別に読み取る２人以上の読み取りオペレータの各オペレータ端末から１つずつ送信された部分文字画像データの読み取り結果としての複数の部分テキストデータが一致しているか否かを判定する部分テキスト判定部と、部分テキスト判定部が一致すると判定した部分テキストデータを記憶する記憶部と、記憶部に記憶したテキストデータの全体分の部分テキストデータを、複数の部分文字画像データに文字画像データを分割した配列に則して繋げることで、テキストデータを作成するテキスト作成部とを有したものとし、テキストデータの作成後、サーバ側の通信部により、通信回線網を利用してテキストデータを端末に返送するようにした方が望ましい。 Further, according to the present invention, the text data conversion processing unit includes each of an image dividing unit that divides character image data into a plurality of partial character image data and two or more reading operators that individually read one partial character image data. Partial text determination unit for determining whether or not a plurality of partial text data as a result of reading partial character image data transmitted from the terminal one by one matches, and partial text data determined to match the partial text determination unit The text data is created by connecting the partial text data of the entire text data stored in the storage unit and the partial text data of the text data stored in the storage in accordance with an arrangement in which the character image data is divided into a plurality of partial character image data. A text creation unit, and after creating text data, the server side communication unit uses the communication line network. The person who was to return the text data to the terminal is desirable.

ここでいう「文字画像データを複数の部分文字画像データに分割する」とは、文字画像データを、所定の文字数ごと、所定の紙面領域（所定間隔、所定行数ごと）ごとに細分化することをいう。文字画像データの分割数、分割サイズは任意である。
「部分文字画像データ」とは、文字画像データを画像分割部により分割して得られたデータである。
「画像分割部」とは、この文字画像データの分割を行うソフトウエアである。
「１つの部分文字画像データを個別に読み取る２人以上の読み取りオペレータ」とは、文字画像データを分割して得た複数の部分文字画像データのうち、選出された１つの部分文字画像データについて、互いに協議することなく独自の判断で読み取りを行う２人または３人以上の読み取りオペレータをいう。 Here, “divide character image data into a plurality of partial character image data” means that character image data is subdivided by a predetermined number of characters and by a predetermined paper area (predetermined interval, every predetermined number of lines). Say. The number of divisions and the division size of character image data are arbitrary.
“Partial character image data” is data obtained by dividing character image data by an image dividing unit.
The “image dividing unit” is software for dividing the character image data.
“Two or more reading operators that individually read one partial character image data” refers to one selected partial character image data among a plurality of partial character image data obtained by dividing character image data. Two or more reading operators who read at their own discretion without consulting each other.

「各オペレータ端末から１つずつ送信された部分文字画像データの読み取り結果としての複数の部分テキストデータ」とは、２人以上の読み取りオペレータが個別に所有するオペレータ端末から部分テキスト判定部に送信（入力）された各オペレータの部分文字画像データの読み取り結果を意味する。各オペレータ端末からの読み取り結果は１台につき１つずつである。
「部分テキスト判定部」とは、各オペレータ端末から１つずつ送信された複数の部分テキストデータが一致（合致）または不一致かを判定するソフトウエアである。判定の対象となる部分テキストデータが３つ以上の場合の判定としては、例えばすべての部分テキストデータが一致するか否かという判定方法を採用することができる。この場合には、読み取りオペレータの人数が増えるほど、部分文字画像データの読み取り精度が高まる。その他、このすべての部分テキストデータが一致するか否かという判定方法の場合を除く、１組（２つ）以上の部分テキストデータが一致するか否かの判定としてもよい。 “A plurality of partial text data as a result of reading partial character image data transmitted one by one from each operator terminal” is transmitted from the operator terminals individually owned by two or more reading operators to the partial text determination unit ( It means the reading result of the input partial character image data of each operator. One reading result is obtained from each operator terminal.
The “partial text determination unit” is software that determines whether a plurality of partial text data transmitted one by one from each operator terminal match (match) or do not match. As a determination when there are three or more partial text data to be determined, for example, a determination method of whether or not all partial text data matches can be employed. In this case, the accuracy of reading partial character image data increases as the number of reading operators increases. In addition, it may be determined whether or not one set (two) or more partial text data matches except for the determination method of whether or not all the partial text data match.

「記憶部」としては、例えばサーバのホストコンピュータが有する各種のストレージなどを採用することができる。
ここでいう「記憶部に記憶したテキストデータの全体分の部分テキストデータ」とは、読み取り対象としての文字画像データの全体分量にあたる、すべての部分文字画像データについて個別に行われた部分テキスト判定部による一致判定（みなし一致判定を含む）で得られた部分テキストデータ（テキストデータを構成するために必要なすべての部分テキストデータ）を意味する。
ここでいう「テキストデータの全体分の部分テキストデータを、複数の部分文字画像データに文字画像データを分割した配列に則して繋げる」とは、複数の部分文字画像データに文字画像データを分割したときの配列順で、記憶部に記憶されたすべての部分テキストデータ、すなわち部分テキスト判定部による判定で一致（みなし一致判定を含む）しているとされたすべての部分テキストデータを繋げることで、テキストデータを作成することをいう。
「テキスト作成部」とは、サーバのホストコンピュータに設けられた前記すべての部分テキストデータからテキストデータを作成する（組み立てる）ソフトウエアである。 As the “storage unit”, for example, various storages possessed by the host computer of the server can be employed.
The “partial text data for the entire text data stored in the storage unit” here is a partial text determination unit that is individually performed for all partial character image data corresponding to the total amount of character image data to be read. Means partial text data (all partial text data necessary for composing text data) obtained by the match determination (including deemed match determination).
Here, “partial text data for the entire text data is connected in accordance with an array in which character image data is divided into a plurality of partial character image data” means that character image data is divided into a plurality of partial character image data. By connecting all the partial text data stored in the storage unit, that is, all the partial text data determined to match (including the deemed match determination) by the partial text determination unit in the arrangement order when To create text data.
The “text creation unit” is software that creates (assembles) text data from all the partial text data provided in the host computer of the server.

また、この発明は、部分テキスト判定部として、１人目の読み取りオペレータのオペレータ端末から送信された読み取り結果としての部分テキストデータと、２人目の読み取りオペレータのオペレータ端末から送信された読み取り結果としての部分テキストデータとが一致しているか否かを判定する第１の部分判定部と、第１の部分判定部の判定結果が不一致だった場合の３人目の読み取りオペレータのオペレータ端末から送信された読み取り結果としての部分テキストデータと、１人目の読み取りオペレータのオペレータ端末から送信された部分テキストデータとが一致しているか否か、および、３人目の部分テキストデータと２人目の読み取りオペレータのオペレータ端末から送信された部分テキストデータとが一致しているか否かを判定する第２の部分判定部とを有したものを採用し、記憶部に記憶される部分テキストデータは、第１の部分判定部が一致していると判定した部分テキストデータか、第１の部分判定部が一致していないと判定した後の第２の部分判定部が一致していると判定した部分テキストデータとした方が望ましい。 Further, the present invention provides a partial text data as a partial text determination unit that is transmitted as a result of reading from the operator terminal of the first reading operator and a part as a result of reading that is transmitted from the operator terminal of the second reading operator. A reading result transmitted from the operator terminal of the third reading operator when the determination results of the first partial determination unit and the first partial determination unit that determine whether or not the text data match are the same Whether or not the partial text data matches the partial text data transmitted from the operator terminal of the first reading operator, and is transmitted from the operator terminal of the third reading operator It is determined whether or not the partial text data The partial text data stored in the storage unit is the partial text data determined by the first partial determination unit or the first part. It is desirable to use partial text data that has been determined that the second partial determination units after determining that the determination units do not match.

記憶部に記憶される部分テキストデータは、第１の部分判定部が一致していると判定した部分テキストデータである。または、第１の部分判定部が一致しないと判定した場合における第２の部分判定部が一致していると判定した部分テキストデータとなる。
さらに、第２の部分判定部による判定で一致するという結果が得られなかった場合には、４人目以降（５人目、６人目…、ｎ人目）の読み取りオペレータのオペレータ端末から送信された部分テキストデータと、それまでに取得された部分テキストデータとが一致しているかを、第３以降（第４、第５…、ｎ−１番目）の部分判定部により一致しているか否かを判定するようにしてもよい。また、４人目以降の読み取りオペレータは、それまでに取得された部分テキストデータを参照し、記憶部に記憶すべき部分テキストデータが存在すれば、取得済みの部分テキストデータの中から該当する部分テキストデータを選出し、この選出した部分テキストデータを４人目以降の読み取りオペレータの端末から、そのオペレータの読み取り結果の部分テキストデータとして送信するようにしてもよい。 The partial text data stored in the storage unit is partial text data determined by the first partial determination unit to match. Or it becomes the partial text data determined as the 2nd partial determination part in the case where it determines with a 1st partial determination part not matching.
Furthermore, if the result of the determination by the second partial determination unit does not give a match, the partial text transmitted from the operator terminal of the fourth or later (fifth, sixth,..., Nth) reading operator It is determined by the third and subsequent (fourth, fifth,..., N−1th) partial determination units whether the data and the partial text data acquired so far match. You may do it. Further, the fourth and subsequent reading operators refer to the partial text data acquired so far, and if there is partial text data to be stored in the storage unit, the corresponding partial text from the acquired partial text data. Data may be selected, and the selected partial text data may be transmitted as partial text data of the reading result of the operator from the terminal of the fourth or later reading operator.

請求項１に記載の発明によれば、ノートの２次元コードを２次元コードリーダによって読み取り、その後、通信回線網を利用し、２次元コードの読み取り結果であるノートの認証情報とノート記載文字のテキストデータ化利用回数情報とが、サーバのノート認証部に送信される。これにより、ノートのアクティベート（認証処理）が実施されるとともに、サーバ側に、そのノートのテキストデータ化利用回数情報も伝わる。その結果、クライアントがテキストデータ化処理サービスを要求する前に、ノートの規格（頁数など）を把握しておくことができる。よって、例えば手書き文字の消去やコピーノートを用いてのノートの不正使用、ひいてはテキストデータ化処理部による手書き文字のテキストデータ化処理サービスの不正利用を防止することができる。
次に、端末（例えばスマートフォン）のデジタルカメラにより撮影した文字画像データを、通信回線網を利用して文字読み取りセンタとしてのサーバに送信し、サーバが有したテキストデータ化処理部により、手書き文字をイメージ化した文字画像データに対するテキストデータ化処理を行う。これにより、手書き文字のテキストデータ化処理を高精度に行うことができる。 According to the first aspect of the present invention, the two-dimensional code of the notebook is read by the two-dimensional code reader, and thereafter, the authentication information of the notebook as a result of reading the two-dimensional code and the characters written in the notebook are read using the communication network. The text data use count information is transmitted to the note authentication unit of the server. Thereby, the activation (authentication process) of the note is performed, and the number of times of use of the note as text data is also transmitted to the server side. As a result, before the client requests the text data conversion processing service, it is possible to grasp the standard of the notebook (number of pages, etc.). Therefore, for example, it is possible to prevent erasure of handwritten characters, unauthorized use of notes using copy notes, and consequently unauthorized use of a text data conversion processing service for handwritten characters by the text data conversion processing unit.
Next, character image data captured by a digital camera of a terminal (for example, a smartphone) is transmitted to a server as a character reading center using a communication network, and handwritten characters are converted by a text data conversion processing unit included in the server. Text data conversion processing is performed on the imaged character image data. Thereby, the text data conversion processing of handwritten characters can be performed with high accuracy.

特に、請求項２に記載の発明によれば、画像分割部により文字画像データを複数の部分文字画像データに分割し、その後、部分テキスト判定部によって、それぞれの部分文字画像データに対して、個別に読み取る２人以上の読み取りオペレータの各オペレータ端末から１つずつ送信された複数の部分テキストデータが一致しているか否かを判定する。ここで、部分テキスト判定部が一致すると判定した部分テキストデータは記憶部に記憶する。同様の処理で、テキストデータの全体分の部分テキストデータが記憶部に記憶された後、テキスト作成部により、複数の部分文字画像データに文字画像データを分割した配列に則して、記憶部の中のすべての部分テキストデータを繋げて、テキストデータを作成する。これにより、ＯＣＲよりも高精度な手書き文字のテキストデータ化処理が可能となる。しかも、画像分割部により文字画像データを複数の部分文字画像データに分割することで、文字画像データの全体を各オペレータが目視することができない。これにより、情報漏洩を防止することができる。 In particular, according to the second aspect of the present invention, the character image data is divided into a plurality of partial character image data by the image dividing unit, and thereafter, each partial character image data is individually separated by the partial text determining unit. It is determined whether or not a plurality of pieces of partial text data transmitted one by one from each operator terminal of two or more reading operators that match each other match. Here, the partial text data determined to be matched by the partial text determination unit is stored in the storage unit. In the same process, after the partial text data for the entire text data is stored in the storage unit, the text creation unit stores the partial text data into a plurality of partial character image data in accordance with the arrangement obtained by dividing the character image data. Create text data by connecting all the partial text data. As a result, it becomes possible to convert the text data of handwritten characters with higher accuracy than OCR. In addition, by dividing the character image data into a plurality of partial character image data by the image dividing unit, each operator cannot visually see the entire character image data. Thereby, information leakage can be prevented.

請求項３に記載の発明によれば、部分テキスト判定部により判定する際には、まず１人目の読み取りオペレータのオペレータ端末から送信された読み取り結果としての部分テキストデータと、２人目の読み取りオペレータのオペレータ端末から送信された読み取り結果としての部分テキストデータとが一致しているか否かを第１の部分判定部により判定する。ここで、第１の部分判定部が一致していないと判定した場合には、３人目の読み取りオペレータのオペレータ端末から送信された読み取り結果としての部分テキストデータと、１人目の読み取りオペレータのオペレータ端末から送信された部分テキストデータとが一致しているか否か、および、３人目の部分テキストデータと２人目の読み取りオペレータのオペレータ端末から送信された部分テキストデータとが一致しているか否かを、第２の判定部により判定する。
このような複数ステップで行われる部分テキスト判定部による判定後、記憶部に記憶される部分テキストデータは、第１の部分判定部が一致していると判定した部分テキストデータか、第２の部分判定部が一致していると判定した部分テキストデータとなる。その結果、オペレータ端末の使用数を少なくして、請求項２の場合よりもさらに高精度な手書き文字のテキストデータ化処理が可能となる。 According to the third aspect of the present invention, when the partial text determination unit determines, the partial text data as the read result transmitted from the operator terminal of the first reading operator and the second reading operator The first partial determination unit determines whether or not the partial text data as a reading result transmitted from the operator terminal matches. Here, when it is determined that the first partial determination units do not match, the partial text data as the read result transmitted from the operator terminal of the third reading operator and the operator terminal of the first reading operator Whether or not the partial text data transmitted from the second partial text data and the partial text data transmitted from the operator terminal of the second reading operator match. The determination is made by the second determination unit.
After the determination by the partial text determination unit performed in such a plurality of steps, the partial text data stored in the storage unit is the partial text data determined by the first partial determination unit to match or the second part The partial text data determined to match the determination units. As a result, the number of operator terminals used can be reduced, and handwritten character text data can be processed with higher accuracy than in the case of claim 2.

この発明の実施例１に係る手書き文字テキスト化システムの全体構成を示す斜視図である。BRIEF DESCRIPTION OF THE DRAWINGS It is a perspective view which shows the whole structure of the handwritten character text conversion system which concerns on Example 1 of this invention. この発明の実施例１に係る手書き文字テキスト化システムの要部ブロック図である。It is a principal part block diagram of the handwritten character text conversion system which concerns on Example 1 of this invention. この発明の実施例１に係る手書き文字テキスト化システムのノート認証のフローシートである。It is a flow sheet of note authentication of the handwritten character text conversion system concerning Example 1 of this invention. この発明の実施例１に係る手書き文字テキスト化システムの手書き文字テキスト化処理の全体フローシートである。It is a whole flow sheet of handwritten character text conversion processing of the handwritten character text conversion system concerning Example 1 of this invention. この発明の実施例１に係る手書き文字テキスト化システムのテキストデータ化処理工程のフローシートである。It is a flow sheet of the text data conversion processing step of the handwritten character text conversion system according to Embodiment 1 of the present invention. この発明の実施例１に係る手書き文字テキスト化システムのノートの手書きスペースを有する紙面の拡大正面図である。It is an enlarged front view of the paper surface which has the handwriting space of the notebook of the handwritten character text conversion system which concerns on Example 1 of this invention. この発明の実施例１に係る手書き文字テキスト化システムのテキストデータ化処理の部分テキスト判定工程のフローシートである。It is a flowchart of the partial text determination process of the text data conversion process of the handwritten character text conversion system which concerns on Example 1 of this invention.

以下、この発明の実施例を具体的に説明する。 Examples of the present invention will be specifically described below.

図１および図２において、１０は実施例１に係る手書き文字テキスト化用システム（以下、テキスト化システム）で、このテキスト化システム１０は、各紙１１の外周部を除く部分に手書き文字の手書きスペースＳを有し、ノート認証用のＱＲコード（２次元コード）１４が表示された手書き用のノート１５と、手書きスペースＳに記入された手書き文字を撮影するＣＭＯＳカメラ（デジタルカメラ）１６、ＱＲコード１４を読み取る２次元コードリーダ１７、インターネット（通信回線網）１８を利用する端末側の通信部１９、および、テキストデータ化処理用のアプリケーション２０をそれぞれ搭載した端末（ここではスマートフォン）２１と、端末側の通信部１９よりそれぞれ送信された２次元コードリーダ１７によるＱＲコード１４の読み取りデータと、ＣＭＯＳカメラ１６による手書き文字の文字画像データとを、インターネット１８を利用して受信するサーバ２２と、インターネット１８を利用するサーバ側の通信部２３と、サーバ２２に設けられ、文字画像データを読み取ってテキストデータに変換するテキストデータ化処理部２４と、サーバ２２に設けられ、かつＱＲコード１４の読み取りデータを受信することで、ノート１５の利用を認証するとともに、ノート１５によるテキストデータ化処理部２４の利用回数のデータを取得するノート認証部２５とを備えている。 In FIG. 1 and FIG. 2, reference numeral 10 denotes a handwritten character text conversion system (hereinafter referred to as a text conversion system) according to the first embodiment. S, a handwritten note 15 on which a QR code (two-dimensional code) 14 for note authentication is displayed, a CMOS camera (digital camera) 16 that captures handwritten characters entered in the handwriting space S, and a QR code A two-dimensional code reader 17 that reads 14, a terminal-side communication unit 19 that uses the Internet (communication network) 18, and a terminal (here, a smartphone) 21 that includes an application 20 for text data conversion processing, and a terminal QR code 14 by the two-dimensional code reader 17 respectively transmitted from the communication unit 19 on the side A server 22 that receives read data and character image data of handwritten characters by the CMOS camera 16 using the Internet 18, a server-side communication unit 23 that uses the Internet 18, and the server 22 are provided in the character image. The text data conversion processing unit 24 that reads data and converts it into text data, and is provided in the server 22 and receives the read data of the QR code 14, thereby authenticating the use of the notebook 15 and the text data generated by the notebook 15. And a note authenticating unit 25 that acquires data on the number of times the data processing unit 24 is used.

図１および図６に示すように、ノート１５は、端末２１と同一サイズ（横６０ｍｍ、縦１１５ｍｍ）の２９枚（説明書付きの２０枚〜５０枚でもよい）の紙１１を、縦綴りし、これを厚地の表紙１２と裏表紙１３との間に挟み込んだ冊子である。よって、テキストデータ化処理サービスの最大利用回数は２９回である。なお、表紙１２と１枚目の紙１１との間には、表面に２次元コードが表示され、かつ裏面にノート認証を含むノートの使用法が記載された認証専用紙１２Ａが差し入れられている。また、表紙１２の表面の中央部には、図示しない提携企業の社名とロゴとが印刷されている。 As shown in FIG. 1 and FIG. 6, the notebook 15 has 29 sheets (20 to 50 sheets with instructions) of the same size as the terminal 21 (60 mm in width and 115 mm in length) that are bound vertically. This is a booklet in which this is sandwiched between a thick cover 12 and a back cover 13. Therefore, the maximum use count of the text data conversion processing service is 29 times. In addition, between the cover 12 and the first sheet 11, an authentication-dedicated paper 12A in which a two-dimensional code is displayed on the front surface and usage of the notebook including note authentication is described on the back surface is inserted. . In addition, a company name and a logo of a partner company (not shown) are printed at the center of the surface of the cover 12.

各紙１１の表面の上端部には、ヘッダに当たる記入日（記入した年月日）２６を手書きするため、８桁の矩形状の小枠が印刷されている。また、各紙１１の表面の下端部には、フッタに該当する６桁の数字（英字でもよい）のフッタコード（分類コード）２７が印字されている。フッタコード２７は、左から３ケタまでが広告（企業）コード（例えば０００；広告なし、００１；第１の自社広告、００２；第２の自社広告など）、左から４桁目が機能コード（例えば０；機能なし、１；カレンダー、２；タグ、カレンダーなど）、左から５桁目が文字画像データの分割方法のコード（例えば０；現在の固定座標分割、１；マニュアル分割など）、左から６桁目（右端）が検査数字であるチェックデジットのコードである。ここでのチェックデジットの計算方法は、広告コードの各桁の数字をすべて足し、機能コードの数字を２倍し、分割コードの数字を３倍し、これらを合算して得た総数を１０で除算した答えの余りをチェックデジットとする。これらは、サーバ２２の記憶部３３に記憶される。 An 8-digit rectangular small frame is printed on the upper end of the surface of each paper 11 in order to handwritten the entry date (date entered) 26 corresponding to the header. Further, a footer code (classification code) 27 of a 6-digit number (which may be an alphabetic character) corresponding to a footer is printed on the lower end portion of the surface of each paper 11. The footer code 27 includes an advertisement (company) code from the left to the third digit (for example, 000; no advertisement, 001: first house advertisement, 002; second house advertisement, etc.), and the fourth digit from the left is a function code ( For example, 0; no function, 1; calendar, 2; tag, calendar, etc.) The fifth digit from the left is the code of the character image data division method (for example, 0; current fixed coordinate division, 1; manual division, etc.), left The sixth digit (right end) is a check digit code which is a check digit. The check digit is calculated by adding all the digits of the advertisement code, doubling the number of the function code, multiplying the number of the divided code by 3, and adding them up to the total number of 10 The remainder of the divided answer is used as a check digit. These are stored in the storage unit 33 of the server 22.

手書きの記入日２６は、後述するＯＣＲ部２８によって自動的に読み取られ、また活字のフッタコード２７は端末側のＯＣＲ部（認識できない場合はオペレータによる読み取り）５２によって自動的に読み取られる。なお、ヘッダに手書きされた記入日２６とフッタコード２７の活字とを各判定部により読み取る際には、これらの領域に記入された各文字を、同時に読み取るようにしてもよい。さらに、各紙１１の表面の四隅には、歪補正用およびノート情報取得用のガイドマーク（分類マーク）２９が印刷されている。この四隅のガイドマーク２９をカメラ撮影することで、後述するアプリケーション２０に搭載された歪補正部５３により、カメラ撮影時の文字画像データの歪みが補正される。しかも、アプリケーション２０を使用して、ガイドマーク２９の形状、四隅での配列などからノートサイズやノート等級（テキストデータ化提供速度・品質を変えてもよい）などの情報を読み取る。 The handwritten entry date 26 is automatically read by an OCR unit 28 (to be described later), and the type footer code 27 is automatically read by an OCR unit (read by an operator if it cannot be recognized) 52 on the terminal side. When the determination date is used to read the entry date 26 handwritten in the header and the type of footer code 27, the characters entered in these areas may be read simultaneously. Further, guide marks (classification marks) 29 for distortion correction and note information acquisition are printed at the four corners of the surface of each paper 11. When the guide marks 29 at the four corners are photographed by the camera, the distortion of the character image data at the time of camera photographing is corrected by the distortion correcting unit 53 installed in the application 20 described later. In addition, the application 20 is used to read information such as the note size and note grade (text data provision speed and quality may be changed) from the shape of the guide mark 29 and the arrangement at the four corners.

ガイドマーク２９は、表１に示すように、ヘッダ（左隅）用の６種類（横並び）と、フッタ（左隅）用の６種類（縦並び）とからなる。ヘッダ用の６種類のものとしては、１；上下に近接した正方形が２個の組み合わせ、２；１個の正方形と１個の縦長な長方形が上下に近接した組み合わせ、３；２個の正方形が上下に離間した組み合わせ、４；２個の若干横長な長方形が左右に近接した組み合わせ、５；１個の正方形と１個の横長な長方形とが左右に近接した組み合わせ、６；２個の正方形が左右に離間した組み合わせが挙げられる。また、フッタ用の６種類としては、１；２個の正方形が上下に近接した組み合わせ、２；１個の縦長な長方形と１個の正方形とが上下に近接した組み合わせ、３；２個の正方形が上下に離間した組み合わせ、４；２個の若干横長な正方形が左右に近接した組み合わせ、５；１個の正方形と１個の横長な長方形とが左右に近接した組み合わせ、６；２個の正方形が左右に離間した組み合わせが挙げられる。表１のデータは、サーバ２２の記憶部３３にあらかじめ記憶されている。 As shown in Table 1, the guide mark 29 includes six types (horizontal alignment) for the header (left corner) and six types (vertical alignment) for the footer (left corner). The six types of headers are: 1; a combination of two squares close to each other up and down; 2; a combination of one square and one long rectangle close to each other up and down; 3; two squares 4; a combination of two slightly oblong rectangles close to the left and right; 5; a combination of one square and one oblong rectangle close to the left and right; 6; two squares Combinations separated from each other can be mentioned. In addition, the six types for the footer are: 1; a combination in which two squares are close to each other up and down; 2; a combination in which one vertical rectangle and one square are close to each other in a vertical direction; 3; two squares 4; a combination of two slightly horizontal squares close to the left and right 5; a combination of one square and one horizontal rectangle close to the left and right 6; two squares Is a combination separated from side to side. The data in Table 1 is stored in advance in the storage unit 33 of the server 22.

また、各紙１１の表面の外周部を除く部分には、手書きスペースＳが形成されている。具体的な手書きスペースＳの形成位置は、各紙１１の表面の記入日２６とフッタコード２７との間の領域である。 Further, a handwritten space S is formed in a portion excluding the outer peripheral portion of the surface of each paper 11. A specific formation position of the handwriting space S is an area between the entry date 26 and the footer code 27 on the surface of each paper 11.

このような手書きスペースＳは、最上段のタイトル記入欄３０と、それより下方の領域を、合計８本の横罫（実線）ａによって上下均等に区分した６つの部分文字画像データ表示欄３１とにより構成されている。なお、横罫ａはノートサイズにより異なる。また、カレンダ連携用時刻記載欄、タグ記載欄などを設けてもよい。各部分文字画像データ表示欄３１は、幅方向の中間部にそれぞれ平行に配置された破線の横罫ｂによって、上下１行ずつの２行書きとなっている。手書き文字は、それぞれの部分文字画像データ表示欄３１に上下２行で記入することが正規であるものの、破線の横罫ｂを跨ぐような大きな字で、１つの部分文字画像データ表示欄３１に１行書きしてもよい。
また、ノート１５の認証専用紙１２Ａの表面の中央部には、ノート認証用の２次元コード１４が１つ印刷されている。サーバ２２側は、この２次元コード１４を２次元コードリーダ１７によって読み取ることで、クライアントに対して事前にノート１５のアクティベート（認証処理）を行わせる。しかも、購入したノート１５のテキストデータ化利用回数の情報を取得する。 Such a handwriting space S includes an uppermost title entry field 30 and six partial character image data display fields 31 in which the area below the uppermost part is equally divided by a total of eight horizontal rules (solid lines) a. It is comprised by. The horizontal rule a differs depending on the note size. Further, a calendar cooperation time description column, a tag description column, and the like may be provided. Each partial character image data display column 31 is written in two lines, one line above and below, with a horizontal ruled line b of a broken line arranged in parallel in the middle part in the width direction. Although it is normal for handwritten characters to be entered in two upper and lower lines in each partial character image data display column 31, the handwritten characters are large characters that straddle the broken horizontal ruled line b and are displayed in one partial character image data display column 31. One line may be written.
Also, one two-dimensional code 14 for note authentication is printed at the center of the surface of the authentication-dedicated paper 12A of the note 15. On the server 22 side, the two-dimensional code 14 is read by the two-dimensional code reader 17 to cause the client to activate the note 15 (authentication process) in advance. In addition, information on the number of times the purchased notebook 15 is converted into text data is acquired.

次に、図１および図２を参照して、クライアントの端末２１を詳細に説明する。
この端末２１は、携帯電話の一種であるスマートフォンで、ＣＰＵ５０に対して、インターネット接続用の端末側の通信部１９と、ネットストアから端末２１にインストールしたテキストデータ化処理専用のアプリケーション２０と、裏面の上部に埋め込まれたデジタルカメラの一種であるＣＭＯＳカメラ１６と、２次元コード１４を読み取る２次元コードリーダ１７と、記憶部３２とがそれぞれ電気的に接続されている。アプリケーション２０には、ＣＭＯＳカメラ１６により撮影されたノート紙面の撮影画像の歪を補正するための歪補正部５３と、フッターコード２７を読み取るＯＣＲ部５２とが搭載されている。歪補正部５３は、手書きスペースＳを有する各紙１１の紙面の四隅に表示された前記複数のガイドマーク２９を利用し、紙面の撮影画像の歪みを台形補正法（射影変換）により補正するソフトウエアである。ＯＣＲ部５２は、印刷された数字が読み取れる程度の簡易なものである。 Next, the client terminal 21 will be described in detail with reference to FIG. 1 and FIG.
The terminal 21 is a smartphone that is a type of mobile phone, and is connected to the CPU 50 with respect to the CPU 50, the communication unit 19 on the terminal side, the application 20 dedicated to text data conversion installed on the terminal 21 from the net store, and the back surface. A CMOS camera 16, which is a kind of digital camera embedded in the upper part, a two-dimensional code reader 17 that reads the two-dimensional code 14, and a storage unit 32 are electrically connected to each other. The application 20 includes a distortion correction unit 53 for correcting distortion of a photographed image of the note paper imaged by the CMOS camera 16 and an OCR unit 52 that reads the footer code 27. The distortion correction unit 53 uses the plurality of guide marks 29 displayed at the four corners of the sheet 11 of each sheet 11 having the handwriting space S, and corrects distortion of the captured image on the sheet by a trapezoidal correction method (projection conversion). It is. The OCR unit 52 is simple enough to read printed numbers.

次いで、図２および図７を参照して、サーバ２２の構成を詳細に説明する。
サーバ２２は、ＣＰＵ５１に対して、サーバ用の通信部２３と、手書きの文字画像データをテキストデータに変換するテキストデータ化処理部２４と、クライアントが購入したノート１５の認証を行うノート認証部２５とが電気的に接続されたものである。 Next, the configuration of the server 22 will be described in detail with reference to FIGS. 2 and 7.
The server 22 has a server communication unit 23, a text data conversion processing unit 24 that converts handwritten character image data into text data, and a note authentication unit 25 that authenticates the notebook 15 purchased by the client. Are electrically connected.

このうち、図２に示すように、テキストデータ化処理部２４は、端末２１からサーバ２２にアップロードされた文字画像データを、タイトルを含めて７つの部分文字画像データに分割する画像分割部３４と、ヘッダの記入日２６および手書きの本文を光学的に読み取るＯＣＲ部２８と、サーバ２２のＯＣＲ部２８と端末２１のＯＣＲ部５２のテキストデータ化が正しく行われたか否かを判定する２人のＯＣＲ判定オペレータα、βの各オペレータ端末２１α、２１βから１つずつ送信された判定結果としてのサーバ２２のＯＣＲ部２８と端末２１のＯＣＲ部５２とのテキストデータが一致しているか否かを判定するＯＣＲ判定部３５と、１つの部分文字画像データを個別に読み取る２人以上の読み取りオペレータの各オペレータ端末２１Ａ〜２１Ｅ（図７）から１つずつ送信された部分文字画像データの読み取り結果としての複数の部分テキストデータが一致しているか否かを判定する部分テキスト判定部４０と、この部分テキスト判定部４０が一致すると判定した部分テキストデータとを記憶する記憶部３３と、記憶部３３に記憶したテキストデータの全体分の部分テキストデータを、合計７つの部分文字画像データに文字画像データを分割した配列に則して繋げることで、テキストデータを作成するテキスト作成部４１とを有している。作成されたテキストデータは、サーバ２２の記憶部３３に記憶され、必要時にクライアントが端末２１やパソコンなどからサーバ２２にアクセスし、テキストデータを取得する。このとき、ＯＣＲ判定オペレータまたは読み取りオペレータは、通常勤務のオペレータではなく、自宅にいながら都合の良い時間に自己の端末を利用して、サーバ２２にアクセスして作業を行ってもよい。部分文字画像データの数は７つに限定されない（例えば１１でもよい）。 Among these, as shown in FIG. 2, the text data conversion processing unit 24 includes an image dividing unit 34 that divides the character image data uploaded from the terminal 21 to the server 22 into seven partial character image data including the title. Two persons who judge whether or not the text data conversion of the OCR unit 28 that optically reads the header entry date 26 and the handwritten text, the OCR unit 28 of the server 22 and the OCR unit 52 of the terminal 21 has been performed correctly. It is determined whether or not the text data of the OCR unit 28 of the server 22 and the OCR unit 52 of the terminal 21 match each other as a determination result transmitted from the operator terminals 21α and 21β of the OCR determination operators α and β. Each of the operator terminals 21A-2 of the two or more reading operators that individually read one partial character image data. A partial text determination unit 40 for determining whether or not a plurality of partial text data as a result of reading partial character image data transmitted one by one from E (FIG. 7) match, and this partial text determination unit 40 The storage unit 33 that stores the partial text data determined to match, and the partial text data for the entire text data stored in the storage unit 33 are arranged in accordance with an arrangement in which character image data is divided into a total of seven partial character image data. Thus, a text creation unit 41 that creates text data is provided. The created text data is stored in the storage unit 33 of the server 22, and when necessary, the client accesses the server 22 from the terminal 21 or a personal computer to acquire the text data. At this time, the OCR determination operator or the reading operator may perform work by accessing the server 22 using his / her terminal at a convenient time while staying at home, instead of being an operator at normal work. The number of partial character image data is not limited to seven (for example, 11 may be sufficient).

ここで、図７のフローシートを参照して、データエントリが行われる部分テキスト判定部の具体的な構成を説明する。
部分テキスト判定部４０は、１人目の読み取りオペレータＡのオペレータ端末２１Ａから送信された読み取り結果としての部分テキストデータと、２人目の読み取りオペレータＢのオペレータ端末２１Ｂから送信された読み取り結果としての部分テキストデータとが一致しているか否かを判定する第１の部分判定部３６と、第１の部分判定部３６の判定結果が不一致だった場合の３人目の読み取りオペレータＣのオペレータ端末２１Ｃから送信された読み取り結果としての部分テキストデータと、１人目の読み取りオペレータＡのオペレータ端末２１Ａから送信された部分テキストデータとが一致しているか否か、および、３人目の部分テキストデータと２人目の読み取りオペレータＢのオペレータ端末２１Ｂから送信された部分テキストデータとが一致しているか否かを判定する第２の判定部３７を有し、第２の部分判定部３７による判定で何れも一致していないという判定結果が得られた場合には、４人目の読み取りオペレータＤのオペレータ端末２１Ｄから、サーバ２２の記憶部３３に送信された読み取り結果としての部分テキストデータを、部分テキスト判定部４０における一致した判定結果とみなすものである。 Here, a specific configuration of the partial text determination unit in which data entry is performed will be described with reference to the flowchart of FIG.
The partial text determination unit 40 includes partial text data as a reading result transmitted from the operator terminal 21A of the first reading operator A and partial text as a reading result transmitted from the operator terminal 21B of the second reading operator B. Transmitted from the operator terminal 21C of the third reading operator C when the determination results of the first partial determination unit 36 and the first partial determination unit 36 that determine whether or not the data match are the same. Whether or not the partial text data as the read result matches the partial text data transmitted from the operator terminal 21A of the first reading operator A, and the third partial text data and the second reading operator Partial text data transmitted from the operator terminal 21B of B If the determination result by the second partial determination unit 37 indicates that they are not consistent with each other, the result is 4 The partial text data as a reading result transmitted from the operator terminal 21D of the human reading operator D to the storage unit 33 of the server 22 is regarded as a matching determination result in the partial text determination unit 40.

したがって、記憶部３３に記憶される部分テキストデータは、第１の部分判定部３６が一致していると判定した部分テキストデータか、第１の部分判定部３６が一致していないと判定した後に、第２の部分判定部３７が一致していると判定した部分テキストデータか、第２の部分判定部３７が一致していないと判定した後に４人目の読み取りオペレータＤのオペレータ端末２１Ｄからサーバ２２の記憶部３３に送信された部分テキストデータの何れかである。つまり、読み取りオペレータＤは、読み取りオペレータＡ、Ｂ、Ｃの読み取り結果を閲覧し、自己の判断で読み取り結果をオペレータ端末２１Ｄから記憶部３３に送信可能である。 Therefore, the partial text data stored in the storage unit 33 is the partial text data determined by the first partial determination unit 36 to match or after the first partial determination unit 36 determines that they do not match. From the operator terminal 21D of the fourth reading operator D after determining that the second partial determination unit 37 matches the partial text data determined to match or the second partial determination unit 37 does not match, the server 22 Any of the partial text data transmitted to the storage unit 33. That is, the reading operator D can browse the reading results of the reading operators A, B, and C, and can transmit the reading results from the operator terminal 21D to the storage unit 33 at his / her own judgment.

次に、図１〜図７を参照して、この発明の実施例１に係るテキスト化システム１０によるクライアントがノート１５に記入した手書き文字のテキストデータ化処理方法を説明する。
まず、クライアントは、スマートフォンである端末２１をインターネット１８に接続し、ネットストアからテキストデータ化処理専用のアプリケーション２０をインストールするとともに、ネットショップなどからノート１５を購入する。
次いで、クライアントは、テキスト化システム１０によるテキストデータ化処理サービスを利用する前に、図３のフローシートに示すように、ノート１５の表紙１２を捲り、認証専用紙１２Ａの２次元コード１４を、２次元（ＱＲ）コードリーダ１７により読み取ってコード情報を得る。その後、端末側の通信部１９からインターネット１８を利用し、２次元コード１４の読み取り結果としてのテキストデータ化利用回数情報（利用回数２９回）を含むノート１５の認証情報をサーバ２２に送信し、これらをサーバ側の通信部２３が受信してノート認証部２５に送信する。その後、ノート認証部２５によるノート１５の認証が行われ、認証終了後、ノート認証部２５から端末２１へ認証完了通知が返信される。これにより、ノート１５のアクティベート（認証処理）が実施されるとともに、サーバ２２にノート１５のテキストデータ化利用回数情報が伝達される。その結果、例えば手書き文字の消去やコピーノートを用いてのノート１５の不正使用、ひいてはこのテキストデータ化処理サービスの不正利用を防止することができる。 Next, with reference to FIGS. 1 to 7, a text data conversion processing method for handwritten characters entered in the note 15 by the client by the text conversion system 10 according to the first embodiment of the present invention will be described.
First, the client connects the terminal 21, which is a smartphone, to the Internet 18, installs an application 20 dedicated to text data conversion from a net store, and purchases a notebook 15 from a net shop or the like.
Next, before using the text data conversion processing service by the text conversion system 10, the client rolls the cover 12 of the notebook 15 and the two-dimensional code 14 of the authentication dedicated paper 12A as shown in the flow sheet of FIG. The code information is obtained by reading with a two-dimensional (QR) code reader 17. Then, using the Internet 18 from the communication unit 19 on the terminal side, the authentication information of the note 15 including the text data conversion usage number information (usage number 29 times) as a result of reading the two-dimensional code 14 is transmitted to the server 22. These are received by the server side communication unit 23 and transmitted to the note authentication unit 25. Thereafter, the note authentication unit 25 authenticates the note 15, and after the authentication is completed, an authentication completion notification is returned from the note authentication unit 25 to the terminal 21. As a result, the activation (authentication process) of the note 15 is performed, and the text data use count information of the note 15 is transmitted to the server 22. As a result, for example, it is possible to prevent erasure of handwritten characters, unauthorized use of the notebook 15 using a copy note, and consequently unauthorized use of this text data conversion processing service.

次に、図４、図５および図７の各フローシートおよび図１、図６を参照して、テキスト化システム１０を用いて行われる、ノート１５に記入した手書き文字のテキストデータ化処理方法を説明する。
まず、図１および図６に示すように、ノート１５の認証専用紙１２Ａをめくり、紙１１の表面のヘッダの８つの小枠内に記入日２６を示す、例えば「２０１１−０８−０６」を、ペンを使って手書きする。ただし、西暦の「２０」はすでに印刷され、フッタコード２７にもあらかじめ６桁「５５５１６５」が印刷されている。その後、手書きスペースＳに、ペンを使用して日誌を手書きする。記入にあたっては、タイトル記入欄３０にタイトルの「ＭＶｐｅｎを購入」を手書きするとともに、上から３段までの部分文字画像データ表示欄３１に、１段２行で日誌を手書きする。具体的には、最上段の部分文字画像データ表示欄３１には「スマートにアナログメモ→ｉｐｈｏｎｅ→ＫＹＢＥＲ」を上下２段に手書きし、上から２段目の部分文字画像データ表示欄３１には「又はＥｖｅｒｎｏｔｅで、グラウド上に保存でき」を上下２段に手書きするとともに、上から３段目の部分文字画像データ表示欄３１の上１段のみに「ないか奮闘中…」をそれぞれ手書きする。 Next, referring to the respective flow sheets of FIGS. 4, 5 and 7 and FIGS. 1 and 6, a text data conversion processing method for handwritten characters entered in a note 15 performed using the text conversion system 10 will be described. explain.
First, as shown in FIGS. 1 and 6, the authentication-dedicated paper 12A of the notebook 15 is turned, and the entry date 26 is shown in the eight small frames of the header on the surface of the paper 11, for example, “2011-08-06”. Handwriting with a pen. However, “20” in the Christian era is already printed, and the 6-digit “555165” is also printed in the footer code 27 in advance. Thereafter, the diary is handwritten in the handwriting space S using a pen. In the entry, the title “Purchase MV pen” is handwritten in the title entry field 30 and the diary is handwritten in two lines per line in the partial character image data display field 31 from the top three. Specifically, “smart analog memo → iphone → KYBER” is handwritten in the upper and lower two rows in the uppermost partial character image data display column 31, and the partial character image data display column 31 in the second row from the top. Handwritten “Or Evernote can be saved on the ground” in the upper and lower two rows, and handwritten “I am struggling ...” only in the upper row of the partial character image data display column 31 in the third row from the top. .

その後、図４のフローシートに示すように、端末（スマートフォン）２１のアプリケーション２０を作動した状態で、ＣＭＯＳカメラ１６により、手書きされた紙１１の表面全体を撮影する。撮影された文字画像データは、紙１１の表面の四隅に印刷されたガイドマーク２９を利用し、アプリケーション２０に搭載された歪補正部（歪補正機能）５３により、カメラ１６撮影時の歪みが補正される。すなわち、歪補正部５３は、文字画像データの中から、手書きスペースＳを有する各紙１１の紙面の四隅に表示されたガイドマーク２９を探し出し、これらを利用し、紙面の撮影画像の歪みを周知の台形補正法（射影変換）により補正する。また、アプリケーション２０に搭載されたＯＣＲ部５２により、フッターコード２７を読み取る。 Thereafter, as shown in the flow sheet of FIG. 4, the entire surface of the handwritten paper 11 is photographed by the CMOS camera 16 with the application 20 of the terminal (smartphone) 21 activated. The photographed character image data uses guide marks 29 printed at the four corners of the surface of the paper 11, and a distortion correction unit (distortion correction function) 53 installed in the application 20 corrects distortion at the time of shooting with the camera 16. Is done. That is, the distortion correction unit 53 searches the character image data for the guide marks 29 displayed at the four corners of the paper 11 having the handwriting space S, and uses these to make known the distortion of the photographed image on the paper. Correct by the keystone correction method (projective transformation). Further, the footer code 27 is read by the OCR unit 52 mounted on the application 20.

補正後の文字画像データ、アプリケーション２０により処理された四隅のガイドマーク２９の読み取りデータおよびフッターコード２７の読み取りデータは、端末側の送信部１９からインターネット１８を利用して文字テキスト化センタとしてのサーバ２２に送信され、サーバ側の通信部２３がこれを受信してテキストデータ化処理部２４に送信する。ここでは、まずテキストデータ化処理サービスの利用回数が規定回数（２９回）以内か否かが判断され、規定回数以内である場合には、次のテキストデータ化処理工程に移行する。一方、規定回数を上回っている場合には、規定回数超えの通知を、サーバ側の通信部２３とインターネット１８と端末側の通信部１９とを順次経て、端末２１のアプリケーション２０に送信され、指定回数超えであることが、端末２１の液晶画面に表示される。
次に、テキストデータ化処理部２４によって文字画像データがテキストデータに変換される。得られたテキストデータは、サーバ２２の記憶部３３に記憶される。こうすることで、クライアントの端末２１からアクセスがあったときに、このサーバ２２の記憶部３３からテキストデータを取り出すことができる。なお、テキストデータを、サーバ側の通信部２３から端末側の通信部１９に送信し、端末２１の液晶画面に表示するとともに、端末２１の記憶部（メモリ）３２に記憶するようにしてもよい。 The corrected character image data, the read data of the guide marks 29 at the four corners processed by the application 20 and the read data of the footer code 27 are transmitted from the terminal side transmission unit 19 to the server as a character text conversion center using the Internet 18. 22, the server-side communication unit 23 receives this and transmits it to the text data conversion processing unit 24. Here, it is first determined whether or not the number of times the text data conversion processing service is used is within the prescribed number (29 times). If the number is within the prescribed number, the process proceeds to the next text data conversion processing step. On the other hand, if the specified number of times is exceeded, a notification of exceeding the specified number of times is sent to the application 20 of the terminal 21 via the server-side communication unit 23, the Internet 18, and the terminal-side communication unit 19 in order. That the number of times has been exceeded is displayed on the liquid crystal screen of the terminal 21.
Next, the text data conversion processing unit 24 converts the character image data into text data. The obtained text data is stored in the storage unit 33 of the server 22. By doing so, the text data can be extracted from the storage unit 33 of the server 22 when accessed from the client terminal 21. The text data may be transmitted from the communication unit 23 on the server side to the communication unit 19 on the terminal side, displayed on the liquid crystal screen of the terminal 21, and stored in the storage unit (memory) 32 of the terminal 21. .

ここで、図５および図７のフローシートを参照して、テキストデータ化処理部２４によるテキストデータ化処理工程を詳細に説明する。
テキストデータ化処理部２４では、まず画像分割部３４により文字画像データが、ヘッダの部分と、フッタ部分と、タイトル部分と、手書き文字の本文部分とに分割される。このうち、ヘッダ部分に手書きされた記入日２６と本文部分とはＯＣＲ部２８により光学的に読み取られてテキストデータ化される。フッタコード２７は、あらかじめ端末側のＯＣＲ部５２により読み取られているので、ＯＣＲ判定部３５による判定結果に基づき、インターネット１８を通して端末２１の液晶画面に提携先企業のＵＲＬに結びついた各提携先企業の社名やロゴなどを表示させる。これにより、表紙１２の表面のアナログの宣伝だけでなく、デジタルの宣伝効果も同時に得られる。 Here, the text data conversion processing step by the text data conversion processing unit 24 will be described in detail with reference to the flow sheets of FIGS.
In the text data conversion processing unit 24, first, the image dividing unit 34 divides the character image data into a header portion, a footer portion, a title portion, and a body portion of handwritten characters. Among these, the entry date 26 and the body part handwritten in the header part are optically read by the OCR unit 28 and converted into text data. Since the footer code 27 is read in advance by the OCR unit 52 on the terminal side, each partner company linked to the URL of the partner company on the liquid crystal screen of the terminal 21 through the Internet 18 based on the determination result by the OCR determination unit 35. Display the company name or logo. As a result, not only analog advertisement on the surface of the cover 12 but also digital advertisement effect can be obtained at the same time.

四隅のガイドマーク２９の読み取りデータに基づき、ノート１５の用紙サイズや等級（テキストデータ化提供速度、品質を変えてもよい）をアプリケーション２０にて認識する。すなわち、ヘッダの左隅に印刷された複数のガイドマーク２９の形状および配置と、フッタの左隅に印刷された複数のガイドマーク２９の形状および配置との組み合わせを、表１に示すガイドマーク２９の分類表に照らし合わせることで、ノート１５の用紙サイズと等級とを認識することができる。例えば、表１の分類表において、ヘッダ用が「１」でフッタ用が「３」の組み合わせの場合は、アイフォン用（通常価格）であることが判る（なお、その他の組み合わせは、表１を参照）。端末２１から送信されたデータに基づき、文字画像データを、部分文字画像データに分割する。 Based on the read data of the guide marks 29 at the four corners, the application 20 recognizes the paper size and grade of the notebook 15 (the text data provision speed and quality may be changed). That is, combinations of the shape and arrangement of the plurality of guide marks 29 printed at the left corner of the header and the shape and arrangement of the plurality of guide marks 29 printed at the left corner of the footer are classified into the guide marks 29 shown in Table 1. The paper size and grade of the notebook 15 can be recognized by checking against the table. For example, in the classification table of Table 1, if the combination for the header is “1” and the footer is “3”, it can be seen that the combination is for the iPhone (ordinary price). reference). Based on the data transmitted from the terminal 21, the character image data is divided into partial character image data.

次に、端末２１より送信された６桁の数字からなるフッタコード２７の読み取りデータから、広告コードと機能コードと文字画像データとを取得する。フッタコード２７は、左から３ケタまでが広告（企業）コード（０００；広告なし、００１；第１の自社広告、００２；第２の自社広告）、左から４桁目が機能コード（０；機能なし、１；カレンダー、２；タグ、カレンダー）、左から５桁目が文字画像データの分割方法のコード（０；現在の固定座標分割、１；マニュアル分割）、左から６桁目（右端）が検査数字であるチェックデジットのコードである。チェックデジットの計算方法は、広告コードの各桁の数字（５５５）を足し（５＋５＋５＝１５）、機能コードの数字（１）を２倍し（１×２＝２）、分割コードの数字（６）を３倍し（６×３＝１８）、これらを合算して得た総数（１５＋２＋１８＝３５）を１０で除算した答えの余り（３５÷１０＝３…０５）「５」をチェックデジットとする。 Next, the advertisement code, the function code, and the character image data are acquired from the read data of the footer code 27 consisting of a 6-digit number transmitted from the terminal 21. The footer code 27 includes an advertisement (company) code (000; no advertisement, 001; first house advertisement, 002; second house advertisement) from the left to the third digit, and a function code (0; No function, 1; calendar, 2; tag, calendar), the fifth digit from the left is the code of the character image data division method (0; current fixed coordinate division, 1; manual division), the sixth digit from the left (right end) ) Is a check digit code which is a check digit. The check digit is calculated by adding the number (555) of each digit of the advertisement code (5 + 5 + 5 = 15), doubling the number (1) of the function code (1 × 2 = 2), and the number of divided codes (6 ) Is multiplied by 3 (6 × 3 = 18), and the total number (15 + 2 + 18 = 35) obtained by adding these is divided by 10 (35 ÷ 10 = 3... 05) “5” is the check digit. To do.

その後、図５および図７のフローシートに示すように、ＯＣＲ部２８による記入日２６と、端末側のＯＣＲ部５２によるフッタコード２７と、手書き文字の本文とのテキストデータ化が正しく行われたか否かを２人のＯＣＲ判定オペレータα，βが判定する。その後、２人のＯＣＲ判定オペレータα，βの各オペレータ端末２１α，２１βから１つずつ送信された判定結果としての記入日２６およびフッタコード２７と、手書き文字の本文のテキストデータが一致しているか否かをＯＣＲ判定部３５が判定し、これらが一致している場合には後述するチェック専用のオペレータＥによるチェックが行われる一方、一致していない場合には、選出された１つの部分文字画像データについて、第１〜第４のオペレータＡ〜Ｄおよびチェック専用のオペレータＥを介在させるデータエントリ方式の読み取り工程が実施される。 Thereafter, as shown in the flow sheets of FIGS. 5 and 7, whether the date of entry 26 by the OCR unit 28, the footer code 27 by the OCR unit 52 on the terminal side, and the body text of the handwritten characters have been correctly converted into text data. The two OCR determination operators α and β determine whether or not. After that, whether the entry date 26 and the footer code 27 as the determination results transmitted one by one from the operator terminals 21α and 21β of the two OCR determination operators α and β match the text data of the handwritten character body. The OCR determination unit 35 determines whether or not, and if they match, a check by a dedicated operator E described later is performed. If they do not match, one selected partial character image A data entry type reading process is performed on the data by interposing the first to fourth operators A to D and the operator E dedicated to checking.

具体的には、１人目の読み取りオペレータＡのオペレータ端末２１Ａから送信された読み取り結果としての部分テキストデータと、２人目の読み取りオペレータＢのオペレータ端末２１Ｂから送信された読み取り結果としての部分テキストデータとが一致しているか否かを第１の部分判定部３６により判定する。一致している場合には、その共通する部分テキストデータがチェック専用のオペレータＥのオペレータ端末２１Ｅに送信される。一方、一致していない場合（不一致率は５％程度）には、３人目の読み取りオペレータＣのオペレータ端末２１Ｃから送信された読み取り結果としての部分テキストデータと、１人目の読み取りオペレータＡのオペレータ端末２１Ａから送信された部分テキストデータとが一致しているか否か、および、３人目の部分テキストデータと２人目の読み取りオペレータＢのオペレータ端末２１Ｂから送信された部分テキストデータとが一致しているか否かを、第２の部分判定部３７により判定する。 Specifically, partial text data as a reading result transmitted from the operator terminal 21A of the first reading operator A, and partial text data as a reading result transmitted from the operator terminal 21B of the second reading operator B Are determined by the first partial determination unit 36. If they match, the common partial text data is transmitted to the operator terminal 21E of the operator E dedicated to checking. On the other hand, if they do not match (the mismatch rate is about 5%), the partial text data as the reading result transmitted from the operator terminal 21C of the third reading operator C and the operator terminal of the first reading operator A Whether or not the partial text data transmitted from 21A matches, and whether or not the third partial text data matches the partial text data transmitted from the operator terminal 21B of the second reading operator B Is determined by the second partial determination unit 37.

第２の部分判定部３７での判定が一致している場合には、前記チェック専用のオペレータ端末２１Ｅに、その共通する部分テキストデータが送信される。一方、第２の部分判定部３７による判定で何れも一致していないという判定結果が得られた場合（不一致率は１％程度）には、４人目の読み取りオペレータＤのオペレータ端末２１Ｄから、サーバ２２の記憶部３３に送信された読み取り結果としての部分テキストデータが、この部分テキスト判定部４０における一致した判定結果とみなし、チェック専用のオペレータ端末２１Ｅに、その見なされた部分テキストデータが送信される。
このように、選出した１つの部分文字画像データに対応して得られた部分テキストデータは、順次、サーバ２２の記憶部３３に記憶される。このとき、記憶部３３に記憶される部分テキストデータは、第１の部分判定部３６が一致していると判定した部分テキストデータか、第１の部分判定部３６が一致していないと判定した後に、第２の部分判定部３７が一致していると判定した部分テキストデータか、第２の部分判定部３７が一致していないと判定した後に、４人目の読み取りオペレータＤのオペレータ端末２１Ｄからサーバ２２の記憶部３３に送信された部分テキストデータの何れかである。これにより、ＯＣＲ部２８よりも高精度な手書き文字のテキストデータ化処理が可能となる。 If the determinations in the second partial determination unit 37 match, the common partial text data is transmitted to the operator terminal 21E dedicated to the check. On the other hand, when a determination result that none of the two matches is obtained by the determination by the second partial determination unit 37 (the mismatch rate is about 1%), the server terminal 21D of the fourth reading operator D receives the server The partial text data as a reading result transmitted to the storage unit 33 of 22 is regarded as a matching determination result in the partial text determination unit 40, and the regarded partial text data is transmitted to the operator terminal 21E dedicated to checking. The
Thus, the partial text data obtained corresponding to the selected partial character image data is sequentially stored in the storage unit 33 of the server 22. At this time, the partial text data stored in the storage unit 33 is determined to be the partial text data determined by the first partial determination unit 36 to match or the first partial determination unit 36 does not match. Later, it is determined from the operator terminal 21D of the fourth reading operator D after determining that the second partial determination unit 37 matches the partial text data or the second partial determination unit 37 does not match. One of the partial text data transmitted to the storage unit 33 of the server 22. As a result, it is possible to process the text data of handwritten characters with higher accuracy than the OCR unit 28.

その後、全ての部分画像データのテキスト化が終了したか否かが判断され、終了している場合には、記憶部３３に記憶された文字画像データの全体分の全ての部分テキストデータを、テキスト作成部４１により、複数の部分文字画像データに文字画像データを分割した配列に則して繋げて、テキストデータを作成する。これにより、ＯＣＲ部２８よりも高精度な手書き文字のテキストデータ化処理が可能となる。
以上説明したテキストデータ化処理部２４によって文字画像データのテキストデータ化処理を行うように構成したので、手書き文字のテキストデータ化処理を高精度で行うことができる。しかも、画像分割部３４により文字画像データを複数の部分文字画像データに分割することで、文字画像データの全体を各オペレータＡ〜Ｄが目視することができない。これにより、情報漏洩を防止することができる。 Thereafter, it is determined whether or not all the partial image data has been converted to text. If it has been completed, all the partial text data for the entire character image data stored in the storage unit 33 is converted into text. The creation unit 41 creates text data by connecting the character image data to a plurality of partial character image data according to an array obtained by dividing the character image data. As a result, it is possible to process the text data of handwritten characters with higher accuracy than the OCR unit 28.
Since the text data conversion processing unit 24 described above performs the text data conversion processing of character image data, the text data conversion processing of handwritten characters can be performed with high accuracy. Moreover, by dividing the character image data into a plurality of partial character image data by the image dividing unit 34, the operators A to D cannot visually see the entire character image data. Thereby, information leakage can be prevented.

また、端末２１のＯＣＲ部５２によるフッタコード２７の読み取り結果に基づき、提携企業が特定される。そのため、この読み取り時にサーバ２２がインターネット１８を通して例えば提携企業のＵＲＬにアクセスし、提携企業のアイコンやバナーを端末２１のアプリケーション２０やクライアントのＰＣ画面上などに、文字画像データとテキストデータとに並べて表示を行うことができる。その結果、ノート１５の表紙１２の表面に印刷されたアナログな企業広告と、液晶画面上でのアイコンやバナーによるデジタルな企業広告とが連動した新しい広告方法を提供することができる。しかも、アクティベーションにスポンサー企業の広告や会社名ロゴをスプラッシュウィンドウとして表示してもよい。 Further, the affiliated company is specified based on the reading result of the footer code 27 by the OCR unit 52 of the terminal 21. Therefore, at the time of reading, the server 22 accesses, for example, the URL of the affiliated company through the Internet 18 and arranges the icon and banner of the affiliated company in the character image data and the text data on the application 20 of the terminal 21 or the client PC screen. Display can be made. As a result, it is possible to provide a new advertising method in which an analog corporate advertisement printed on the surface of the cover 12 of the notebook 15 is linked with a digital corporate advertisement using icons and banners on the liquid crystal screen. In addition, an advertisement of a sponsor company or a company name logo may be displayed as a splash window for activation.

この発明は、ノートに記入された手書き文字を、サーバのテキストデータ化処理部を利用してテキストデータ化する技術として有用である。 The present invention is useful as a technique for converting handwritten characters entered in a notebook into text data using a text data conversion processing unit of a server.

１０手書き文字テキスト化システム、
１４ＱＲコード（２次元コード）、
１５ノート、
１６ＣＭＯＳカメラ（デジタルカメラ）、
１７２次元コードリーダ、
１８インターネット（通信回線網）、
１９端末側の通信部、
２１端末、
２１Ａ〜２１Ｅオペレータ端末、
２２サーバ、
２３サーバ側の通信部
２４テキストデータ化処理部、
２５ノート認証部、
３３記憶部、
３４画像分割部、
４０部分テキスト判定部、
４１テキスト作成部、
Ａ１人目の読み取りオペレータ、
Ｂ２人目の読み取りオペレータ、
Ｃ３人目の読み取りオペレータ、
Ｄ４人目の読み取りオペレータ、
Ｓ手書きスペース。 10 Handwritten text conversion system,
14 QR code (two-dimensional code),
15 notes,
16 CMOS camera (digital camera),
17 2D code reader,
18 Internet (communication network),
19 Communication part on the terminal side,
21 terminals,
21A-21E operator terminal,
22 servers,
23 server side communication unit 24 text data conversion processing unit,
25 Note authentication part,
33 storage unit,
34 Image segmentation unit,
40 partial text determination unit,
41 Text creation part,
A First reading operator,
B Second reading operator,
C Third reading operator,
D 4th reading operator,
S Handwritten space.

Claims

A handwritten note having a handwriting space for characters and displaying a two-dimensional code for note authentication;
A terminal equipped with a digital camera that captures handwritten characters entered in the handwriting space, a two-dimensional code reader that reads the two-dimensional code, and a communication unit on the terminal side that uses a communication network;
Receives the two-dimensional code reading data by the two-dimensional code reader respectively transmitted from the communication unit on the terminal side and the character image data of the handwritten characters by the digital camera using the communication line network. A server equipped with a server-side communication unit;
A note authenticating unit that is provided in the server and that authenticates the use of the note by receiving the read data of the two-dimensional code, and obtains data on the number of times the text data conversion processing unit is used by the note; ,
A handwritten character text conversion system comprising: a text data conversion processing unit that reads the character image data and converts it into text data.

The text data conversion processing unit
An image dividing unit for dividing the character image data into a plurality of partial character image data;
Whether or not a plurality of partial text data as a result of reading the partial character image data transmitted one by one from each operator terminal of two or more reading operators that individually read one partial character image data match. A partial text determination unit for determining
A storage unit for storing the partial text data determined to match the partial text determination unit;
Text for creating the text data by connecting the partial text data for the entire text data stored in the storage unit in accordance with an array obtained by dividing the character image data into the plurality of partial character image data The handwritten character text conversion system according to claim 1, further comprising a creation unit.

The partial text determination unit
Whether or not the partial text data as the reading result transmitted from the operator terminal of the first reading operator matches the partial text data as the reading result transmitted from the operator terminal of the second reading operator A first partial determination unit for determining
Partial text data as a reading result transmitted from the operator terminal of the third reading operator when the determination result of the first partial determination unit is inconsistent, and transmitted from the operator terminal of the first reading operator A process for determining whether or not the partial text data matches, and whether or not the third partial text data matches the partial text data transmitted from the operator terminal of the second reading operator. 2 partial determination units,
The partial text data stored in the storage unit is the partial text data determined by the first partial determination unit to match, or after determining that the first partial determination unit does not match, The handwritten character text conversion system according to claim 2, wherein the second partial determination unit is partial text data determined to match.