JP5717831B2

JP5717831B2 - Electronic device and handwritten document processing method

Info

Publication number: JP5717831B2
Application number: JP2013254660A
Authority: JP
Inventors: 千加志杉浦
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2013-12-10
Filing date: 2013-12-10
Publication date: 2015-05-13
Anticipated expiration: 2032-10-26
Also published as: JP2014099182A

Description

本発明の実施形態は、手書き文書の処理に関する。 Embodiments of the present invention relate to processing of handwritten documents.

近年、タブレット、ＰＤＡ、スマートフォンといった種々の電子機器が開発されている。この種の電子機器の多くは、ユーザによる入力操作を容易にするために、タッチスクリーンディスプレイを備えている。 In recent years, various electronic devices such as tablets, PDAs, and smartphones have been developed. Many electronic devices of this type are equipped with a touch screen display to facilitate an input operation by a user.

ユーザは、タッチスクリーンディスプレイ上に表示されるメニューまたはオブジェクトを指などでタッチすることにより、これらメニューまたはオブジェクトに関連付けられた機能の実行を電子機器に指示することができる。 The user can instruct the electronic device to execute a function associated with the menu or object by touching the menu or object displayed on the touch screen display with a finger or the like.

このような電子機器では、ユーザが、タッチスクリーンディスプレイ上で文字や図形等を手書きするための機能を有するものもある。このような手書きの文字や図形を含む手書き文書（手書きページ）は保存され、必要に応じて閲覧される。 Some of such electronic devices have a function for a user to handwrite characters, figures, and the like on a touch screen display. A handwritten document (handwritten page) including such handwritten characters and figures is stored and viewed as necessary.

ところで、ノートのような紙のページに手書きされた文字をスキャンすることによって、そのページの画像データを生成し、この画像データを用いて手書きされた文字を認識する光学文字認識（ＯＣＲ）の技術が利用されている。この技術により、手書きされた文字を文字コードに変換することができる。 By the way, an optical character recognition (OCR) technique for generating image data of a page by scanning a character handwritten on a paper page such as a notebook and recognizing the handwritten character using the image data. Is being used. With this technology, handwritten characters can be converted into character codes.

特開２００３−９９７１３号公報JP 2003-99713 A

ＯＣＲでは、例えば、スキャンされた手書き文書の左上から順に手書きされた文字が認識され、その文字の文字コードが出力される。そのため、例えば、出力された順に並べられた文字コードが、認識結果として画面に表示される。 In OCR, for example, handwritten characters are recognized in order from the upper left of the scanned handwritten document, and the character code of the characters is output. Therefore, for example, character codes arranged in the order of output are displayed on the screen as a recognition result.

しかし、手書き文書上の文字は、例えば、段落や箇条書き、見出しのようなグループとして視認されるように、ユーザによって意図された位置に手書きされることがある。そのため、手書き文書上の文字自体だけではなく、その文字の配置も認識されることが期待される場合がある。 However, the characters on the handwritten document may be handwritten at a position intended by the user so as to be visually recognized as a group such as a paragraph, a bullet, or a headline. Therefore, it may be expected that not only the character itself on the handwritten document but also the arrangement of the character is recognized.

本発明の一形態は、手書き文字を含む手書き文書を、文字コードを含む整形された文書に変換できる電子機器および手書き文書処理方法を提供することを目的とする。 An object of one embodiment of the present invention is to provide an electronic device and a handwritten document processing method capable of converting a handwritten document including a handwritten character into a shaped document including a character code.

実施形態によれば、電子機器は、取得手段および表示制御手段を具備する。取得手段は、複数の行に対応する複数の手書き文字を含む手書き文書のデータを用いて、前記複数の行に対応する複数の手書き文字の文字コードを取得可能である。表示制御手段は、第１条件が満たされる場合、第１行に対応する複数の第１文字コードと第２行に対応する複数の第２文字コードとを用いて、少なくとも１つの第１文字コードに対応する文字を前記第２行に対応する位置に含むか、または少なくとも１つの第２文字コードに対応する文字を前記第１行に対応する位置に含む第１整形文書データを表示可能であり、前記第１条件が満たされない場合、前記複数の第１文字コードと前記複数の第２文字コードとを用いて、前記複数の第１文字コードに対応する複数の文字を前記第１行に対応する位置に含み、前記複数の第２文字コードに対応する複数の文字を前記第２行に対応する位置に含む第２整形文書データを表示可能である。前記第１条件が満たされるか否かは、（１）前記第１行の先頭の文字に対応する文字コードと前記第２行の先頭の文字に対応する文字コードとが一致するか相異するか、または、（２）前記第１行の先頭の文字に対応する文字コードと前記第２行の先頭の文字に対応する文字コードとが箇条書きで用いられる記号に対応するか否か、または、（３）横書きの場合に前記第１行の水平方向の位置と前記第２行の水平方向の位置の関係、または、（４）縦書きの場合に前記第１行の垂直方向の位置と前記第２行の垂直方向の位置の関係、または、（５）前記複数の第１文字コードに対応する複数の文字及び前記複数の第２文字コードに対応する複数の文字がソースコードに対応するか否か、または、（６）前記複数の第１文字コードに対応する複数の文字及び前記複数の第２文字コードに対応する複数の文字が数式に対応するか否かの少なくとも１つを用いて定められる。 According to the embodiment, the electronic device includes an acquisition unit and a display control unit. The acquisition unit can acquire character codes of a plurality of handwritten characters corresponding to the plurality of lines using data of a handwritten document including a plurality of handwritten characters corresponding to the plurality of lines. When the first condition is satisfied , the display control means uses at least one first character code using a plurality of first character codes corresponding to the first row and a plurality of second character codes corresponding to the second row. Can be displayed at a position corresponding to the second line, or at least one character corresponding to the second character code can be displayed at a position corresponding to the first line. When the first condition is not satisfied , a plurality of characters corresponding to the plurality of first character codes are associated with the first row using the plurality of first character codes and the plurality of second character codes. Second formatted document data including a plurality of characters corresponding to the plurality of second character codes at a position corresponding to the second row can be displayed. Whether or not the first condition is satisfied depends on whether (1) the character code corresponding to the first character in the first row matches the character code corresponding to the first character in the second row. Or (2) whether or not the character code corresponding to the first character of the first line and the character code corresponding to the first character of the second line correspond to a symbol used in the bullets, or (3) The relationship between the horizontal position of the first row and the horizontal position of the second row in the case of horizontal writing, or (4) the vertical position of the first row in the case of vertical writing. (5) A plurality of characters corresponding to the plurality of first character codes and a plurality of characters corresponding to the plurality of second character codes correspond to a source code. Or (6) a plurality of characters corresponding to the plurality of first character codes. A plurality of characters corresponding to the character and the plurality of second character codes are determined by using at least one of whether corresponding to formula.

実施形態に係る電子機器の外観を示す斜視図。FIG. 2 is a perspective view illustrating an appearance of the electronic apparatus according to the embodiment. 同実施形態の電子機器によって処理される手書き文書の例を示す図。6 is an exemplary view showing an example of a handwritten document processed by the electronic apparatus of the embodiment. 同実施形態の電子機器によって記憶媒体に保存される、図２の手書き文書に対応する時系列情報を説明するための図。The figure for demonstrating the time series information corresponding to the handwritten document of FIG. 2 preserve | saved at a storage medium by the electronic device of the embodiment. 同実施形態の電子機器のシステム構成を示すブロック図。2 is an exemplary block diagram showing the system configuration of the electronic apparatus of the embodiment. FIG. 同実施形態の電子機器によって実行されるデジタルノートブックアプリケーションプログラムの機能構成を示すブロック図。2 is an exemplary block diagram showing the functional configuration of a digital notebook application program executed by the electronic apparatus of the embodiment. FIG. 手書き文書が文字認識される例を示す図。The figure which shows the example by which a handwritten document is character-recognized. 同実施形態の電子機器によって、図６の手書き文書が文字コードを含む整形された文書に変換される例を示す図。FIG. 7 is an exemplary view illustrating an example in which the handwritten document of FIG. 6 is converted into a shaped document including a character code by the electronic device of the embodiment. 図７の手書き文書から認識される行を説明するための図。The figure for demonstrating the line recognized from the handwritten document of FIG. 図７の手書き文書から認識される文字を説明するための図。The figure for demonstrating the character recognized from the handwritten document of FIG. 図７の手書き文書から認識されるグループを説明するための図。The figure for demonstrating the group recognized from the handwritten document of FIG. 表を含む手書き文書が文字認識される例を示す図。The figure which shows the example by which the handwritten document containing a table | surface is character-recognized. 図１１の手書き文書から認識されるグループを説明するための図。The figure for demonstrating the group recognized from the handwritten document of FIG. 同実施形態の電子機器によって、図１１の手書き文書が文字コードを含む整形された文書に変換される例を示す図。FIG. 12 is an exemplary view illustrating an example in which the handwritten document of FIG. 11 is converted into a shaped document including a character code by the electronic device of the embodiment. 同実施形態の電子機器によって実行される手書き入力処理の手順の例を示すフローチャート。6 is an exemplary flowchart illustrating an example of a procedure of handwriting input processing executed by the electronic apparatus of the embodiment. 同実施形態の電子機器によって実行される手書き文書変換処理の手順の例を示すフローチャート。6 is an exemplary flowchart illustrating an example of a procedure of handwritten document conversion processing executed by the electronic apparatus of the embodiment.

以下、実施の形態について図面を参照して説明する。
図１は、一実施形態に係る電子機器の外観を示す斜視図である。この電子機器は、例えば、ペンまたは指によって手書き入力可能なペン・ベースの携帯型電子機器である。この電子機器は、タブレットコンピュータ、ノートブック型パーソナルコンピュータ、スマートフォン、ＰＤＡ等として実現され得る。以下では、この電子機器がタブレットコンピュータ１０として実現されている場合を想定する。タブレットコンピュータ１０は、タブレットまたはスレートコンピュータとも称される携帯型電子機器であり、図１に示すように、本体１１とタッチスクリーンディスプレイ１７とを備える。タッチスクリーンディスプレイ１７は、本体１１の上面に重ね合わせるように取り付けられている。 Hereinafter, embodiments will be described with reference to the drawings.
FIG. 1 is a perspective view illustrating an external appearance of an electronic apparatus according to an embodiment. This electronic device is, for example, a pen-based portable electronic device that can be handwritten with a pen or a finger. This electronic device can be realized as a tablet computer, a notebook personal computer, a smartphone, a PDA, or the like. Below, the case where this electronic device is implement | achieved as the tablet computer 10 is assumed. The tablet computer 10 is a portable electronic device also called a tablet or a slate computer, and includes a main body 11 and a touch screen display 17 as shown in FIG. The touch screen display 17 is attached to be superposed on the upper surface of the main body 11.

本体１１は、薄い箱形の筐体を有している。タッチスクリーンディスプレイ１７には、フラットパネルディスプレイと、フラットパネルディスプレイの画面上のペンまたは指の接触位置を検出するように構成されたセンサとが組み込まれている。フラットパネルディスプレイは、例えば、液晶表示装置（ＬＣＤ）であってもよい。センサとしては、例えば、静電容量方式のタッチパネル、電磁誘導方式のデジタイザなどを使用することができる。以下では、デジタイザとタッチパネルの２種類のセンサの双方がタッチスクリーンディスプレイ１７に組み込まれている場合を想定する。 The main body 11 has a thin box-shaped housing. The touch screen display 17 incorporates a flat panel display and a sensor configured to detect a contact position of a pen or a finger on the screen of the flat panel display. The flat panel display may be, for example, a liquid crystal display (LCD). As the sensor, for example, a capacitive touch panel, an electromagnetic induction digitizer, or the like can be used. In the following, it is assumed that two types of sensors, a digitizer and a touch panel, are incorporated in the touch screen display 17.

デジタイザおよびタッチパネルの各々は、フラットパネルディスプレイの画面を覆うように設けられる。このタッチスクリーンディスプレイ１７は、指を使用した画面に対するタッチ操作のみならず、ペン１００を使用した画面に対するタッチ操作も検出することができる。ペン１００は例えば電磁誘導ペンであってもよい。 Each of the digitizer and the touch panel is provided so as to cover the screen of the flat panel display. The touch screen display 17 can detect not only a touch operation on a screen using a finger but also a touch operation on a screen using the pen 100. The pen 100 may be an electromagnetic induction pen, for example.

ユーザは、外部オブジェクト（ペン１００又は指）を使用してタッチスクリーンディスプレイ１７上で手書き入力操作を行うことができる。手書き入力操作中においては、画面上の外部オブジェクト（ペン１００又は指）の動きの軌跡、つまり手書き入力操作によって手書きされるストロークの軌跡（筆跡）がリアルタイムに描画され、これによって各ストロークの軌跡が画面上に表示される。外部オブジェクトが画面に接触されている間の外部オブジェクトの動きの軌跡が１ストロークに相当する。手書きされた文字または図形などに対応する多数のストロークの集合、つまり多数の軌跡（筆跡）の集合が手書き文書を構成する。 The user can perform a handwriting input operation on the touch screen display 17 using an external object (the pen 100 or a finger). During the handwriting input operation, the trajectory of the movement of the external object (the pen 100 or the finger) on the screen, that is, the stroke trajectory (handwriting) handwritten by the handwriting input operation is drawn in real time. Displayed on the screen. The trajectory of the movement of the external object while the external object is in contact with the screen corresponds to one stroke. A set of many strokes corresponding to handwritten characters or figures, that is, a set of many trajectories (handwriting) constitutes a handwritten document.

本実施形態では、この手書き文書は、イメージデータではなく、各ストロークの軌跡の座標列とストローク間の順序関係を示す時系列情報を含む手書き文書データとして記憶媒体に保存される。この時系列情報の詳細は図３を参照して後述するが、この時系列情報は、概して、複数のストロークにそれぞれ対応する時系列のストロークデータの集合を意味する。各ストロークデータは、手書きにより入力可能なある一つのストロークを表現可能なデータであればどのようなものであってもよく、例えば、このストロークの軌跡上の点それぞれに対応する座標データ系列（時系列座標）を含む。これらストロークデータの並びの順序は、ストロークそれぞれが手書きされた順序つまり筆順に相当する。 In the present embodiment, the handwritten document is stored in the storage medium as handwritten document data including time series information indicating the coordinate sequence of the trajectory of each stroke and the order relationship between the strokes, not image data. The details of this time series information will be described later with reference to FIG. 3. This time series information generally means a set of time series stroke data respectively corresponding to a plurality of strokes. Each stroke data may be any data as long as it can express a single stroke that can be input by handwriting. For example, a coordinate data series corresponding to each point on the locus of this stroke (time Series coordinates). The order of arrangement of the stroke data corresponds to the order in which the strokes are handwritten, that is, the stroke order.

タブレットコンピュータ１０は、記憶媒体から既存の任意の手書き文書データを読み出し、この手書き文書データに対応する手書き文書、つまり、時系列情報によって示される複数のストロークそれぞれに対応する軌跡が描画された手書き文書を、画面上に表示することができる。 The tablet computer 10 reads arbitrary existing handwritten document data from the storage medium, and the handwritten document corresponding to the handwritten document data, that is, the handwritten document in which the trajectory corresponding to each of the plurality of strokes indicated by the time series information is drawn. Can be displayed on the screen.

次いで、図２および図３を参照して、ユーザによって手書きされたストローク（文字、マーク、図形、表など）と時系列情報との関係について説明する。図２は、ペン１００などを使用してタッチスクリーンディスプレイ１７上に手書きされる手書き文書（手書き文字列）の例を示している。 Next, the relationship between strokes (characters, marks, figures, tables, etc.) handwritten by the user and time-series information will be described with reference to FIGS. FIG. 2 shows an example of a handwritten document (handwritten character string) handwritten on the touch screen display 17 using the pen 100 or the like.

手書き文書では、一旦手書きされた文字や図形などの上に、さらに別の文字や図形などが手書きされるというケースが多い。図２においては、「ＡＢＣ」の手書き文字列が「Ａ」、「Ｂ」、「Ｃ」の順番で手書きされ、この後に、手書きの矢印が、手書き文字「Ａ」のすぐ近くに手書きされた場合が想定されている。 In a handwritten document, there are many cases where another character or graphic is handwritten on the character or graphic once handwritten. In FIG. 2, the handwritten character string “ABC” is handwritten in the order of “A”, “B”, and “C”, and then the handwritten arrow is handwritten in the immediate vicinity of the handwritten character “A”. The case is envisaged.

手書き文字「Ａ」は、ペン１００などを使用して手書きされる２つのストローク（「∧」形状の軌跡、「−」形状の軌跡）によって、つまり２つの軌跡によって表現される。最初に手書きされる「∧」形状のペン１００の軌跡は例えば等時間間隔でリアルタイムにサンプリングされ、これによって「∧」形状のストロークの時系列座標ＳＤ１１、ＳＤ１２、…ＳＤ１ｎが得られる。同様に、次に手書きされる「−」形状のペン１００の軌跡もサンプリングされ、これによって「−」形状のストロークの時系列座標ＳＤ２１、ＳＤ２１、…ＳＤ２ｎが得られる。 The handwritten character “A” is represented by two strokes (“∧” -shaped trajectory, “−”-shaped trajectory) handwritten using the pen 100 or the like, that is, two trajectories. The trajectory of the first “∧” -shaped pen 100 handwritten is sampled in real time, for example, at equal time intervals, thereby obtaining the time-series coordinates SD11, SD12,... SD1n of the “∧” -shaped stroke. Similarly, the trajectory of the “−”-shaped pen 100 to be handwritten next is also sampled, thereby obtaining the time-series coordinates SD21, SD21,... SD2n of the “−”-shaped stroke.

手書き文字「Ｂ」は、ペン１００などを使用して手書きされた２つのストローク、つまり２つの軌跡によって表現される。手書き文字「Ｃ」は、ペン１００などを使用して手書きされた手書きされた１つのストローク、つまり１つの軌跡によって表現される。手書きの「矢印」は、ペン１００などを使用して手書きされた２つのストローク、つまり２つの軌跡によって表現される。 The handwritten character “B” is expressed by two strokes handwritten using the pen 100 or the like, that is, two trajectories. The handwritten character “C” is represented by one stroke handwritten by using the pen 100 or the like, that is, one locus. The handwritten “arrow” is expressed by two strokes handwritten using the pen 100 or the like, that is, two trajectories.

図３は、図２の手書き文書に対応する時系列情報２００を示している。時系列情報は、複数のストロークデータＳＤ１、ＳＤ２、…、ＳＤ７を含む。時系列情報２００内においては、これらストロークデータＳＤ１、ＳＤ２、…、ＳＤ７は、筆跡順に、つまり複数のストロークが手書きされた順に時系列に並べられている。 FIG. 3 shows time-series information 200 corresponding to the handwritten document of FIG. The time series information includes a plurality of stroke data SD1, SD2,. In the time series information 200, the stroke data SD1, SD2,..., SD7 are arranged in time series in the order of handwriting, that is, the order in which a plurality of strokes are handwritten.

時系列情報２００において、先頭の２つのストロークデータＳＤ１、ＳＤ２は、手書き文字「Ａ」の２つのストロークをそれぞれ示している。３番目と４番目のストロークデータＳＤ３、ＳＤ４は、手書き文字「Ｂ」を構成する２つのストロークをそれぞれ示している。５番目のストロークデータＳＤ５は、手書き文字「Ｃ」を構成する１つのストロークを示している。６番目と７番目のストロークデータＳＤ６、ＳＤ７は、手書き矢印を構成する２つのストロークをそれぞれ示している。 In the time series information 200, the first two stroke data SD1 and SD2 indicate two strokes of the handwritten character “A”, respectively. The third and fourth stroke data SD3 and SD4 indicate two strokes constituting the handwritten character “B”, respectively. The fifth stroke data SD5 indicates one stroke constituting the handwritten character “C”. The sixth and seventh stroke data SD6 and SD7 indicate two strokes constituting the handwritten arrow, respectively.

各ストロークデータは、一つのストロークに対応する座標データ系列（時系列座標）、つまり一つのストロークの軌跡上の複数の点それぞれに対応する複数の座標を含む。各ストロークデータにおいて、複数の座標は、ストロークが書かれた順に時系列に並べられている。例えば、手書き文字「Ａ」に関しては、ストロークデータＳＤ１は、手書き文字「Ａ」の「∧」形状のストロークの軌跡上の点それぞれに対応する座標データ系列（時系列座標）、つまりｎ個の座標データＳＤ１１、ＳＤ１２、…ＳＤ１ｎを含む。ストロークデータＳＤ２は、手書き文字「Ａ」の「−」形状のストロークの軌跡上の点それぞれに対応する座標データ系列、つまりｎ個の座標データＳＤ２１、ＳＤ２２、…ＳＤ２ｎを含む。なお、座標データの数はストロークデータ毎に異なっていてもよい。 Each stroke data includes a coordinate data series (time series coordinates) corresponding to one stroke, that is, a plurality of coordinates corresponding to a plurality of points on the trajectory of one stroke. In each stroke data, a plurality of coordinates are arranged in time series in the order in which the strokes are written. For example, for the handwritten character “A”, the stroke data SD1 is a coordinate data series (time series coordinates) corresponding to each point on the locus of the stroke of the “∧” shape of the handwritten character “A”, that is, n coordinates. Data SD11, SD12,... SD1n are included. The stroke data SD2 includes coordinate data series corresponding to each point on the trajectory of the stroke of the “−” shape of the handwritten character “A”, that is, n pieces of coordinate data SD21, SD22,. Note that the number of coordinate data may be different for each stroke data.

各座標データは、対応する軌跡内のある１点に対応するＸ座標およびＹ座標を示す。例えば、座標データＳＤ１１は、「∧」形状のストロークの始点のＸ座標（Ｘ１１）およびＹ座標（Ｙ１１）を示す。ＳＤ１ｎは、「∧」形状のストロークの終点のＸ座標（Ｘ１ｎ）およびＹ座標（Ｙ１ｎ）を示す。 Each coordinate data indicates an X coordinate and a Y coordinate corresponding to a certain point in the corresponding locus. For example, the coordinate data SD11 indicates the X coordinate (X11) and the Y coordinate (Y11) of the start point of the “∧” -shaped stroke. SD1n indicates the X coordinate (X1n) and Y coordinate (Y1n) of the end point of the “∧” -shaped stroke.

さらに、各座標データは、その座標に対応する点が手書きされた時点に対応するタイムスタンプ情報Ｔを含んでいてもよい。手書きされた時点は、絶対時間（例えば、年月日時分秒）またはある時点を基準とした相対時間のいずれであってもよい。例えば、各ストロークデータに、ストロークが書き始められた絶対時間（例えば、年月日時分秒）をタイムスタンプ情報として付加し、さらに、ストロークデータ内の各座標データに、絶対時間との差分を示す相対時間をタイムスタンプ情報Ｔとして付加してもよい。 Further, each coordinate data may include time stamp information T corresponding to the time when a point corresponding to the coordinate is handwritten. The handwritten time may be either absolute time (for example, year / month / day / hour / minute / second) or relative time based on a certain time. For example, the absolute time (for example, year / month / day / hour / minute / second) when the stroke is started is added to each stroke data as time stamp information, and each coordinate data in the stroke data indicates a difference from the absolute time. The relative time may be added as time stamp information T.

このように、各座標データにタイムスタンプ情報Ｔが追加された時系列情報を使用することにより、ストローク間の時間的関係をより精度よく表すことができる。
各座標データには、筆圧を示す情報（Ｚ）を追加してもよい。 As described above, by using the time series information in which the time stamp information T is added to each coordinate data, the temporal relationship between the strokes can be expressed more accurately.
Information (Z) indicating writing pressure may be added to each coordinate data.

さらに、本実施形態では、上述したように、手書き文書は、イメージまたは文字認識結果ではなく、時系列のストロークデータの集合から構成される時系列情報２００として記憶されるので、言語に依存せずに手書きの文字や図形を扱うことができる。よって、本実施形態の時系列情報２００の構造は、使用言語の異なる世界中の様々な国で共通に使用できる。 Furthermore, in the present embodiment, as described above, the handwritten document is stored not as an image or a character recognition result but as time-series information 200 composed of a set of time-series stroke data, so it does not depend on a language. Can handle handwritten characters and figures. Therefore, the structure of the time-series information 200 according to the present embodiment can be used in common in various countries around the world with different languages.

図４は、タブレットコンピュータ１０のシステム構成を示す図である。
タブレットコンピュータ１０は、図４に示されるように、ＣＰＵ１０１、システムコントローラ１０２、主メモリ１０３、グラフィクスコントローラ１０４、ＢＩＯＳ−ＲＯＭ１０５、不揮発性メモリ１０６、無線通信デバイス１０７、エンベデッドコントローラ（ＥＣ）１０８等を備える。 FIG. 4 is a diagram illustrating a system configuration of the tablet computer 10.
As shown in FIG. 4, the tablet computer 10 includes a CPU 101, a system controller 102, a main memory 103, a graphics controller 104, a BIOS-ROM 105, a nonvolatile memory 106, a wireless communication device 107, an embedded controller (EC) 108, and the like. .

ＣＰＵ１０１は、タブレットコンピュータ１０内の各種モジュールの動作を制御するプロセッサである。ＣＰＵ１０１は、ストレージデバイスである不揮発性メモリ１０６から主メモリ１０３にロードされる各種ソフトウェアを実行する。これらソフトウェアには、オペレーティングシステム（ＯＳ）２０１、および各種アプリケーションプログラムが含まれている。アプリケーションプログラムには、デジタルノートブックアプリケーションプログラム２０２が含まれている。このデジタルノートブックアプリケーションプログラム２０２は、上述の手書き文書を作成および表示する機能、手書き文書を文字コードを含む整形された文書に変換する機能、等を有している。 The CPU 101 is a processor that controls the operation of various modules in the tablet computer 10. The CPU 101 executes various software loaded into the main memory 103 from the nonvolatile memory 106 that is a storage device. These software include an operating system (OS) 201 and various application programs. The application program includes a digital notebook application program 202. The digital notebook application program 202 has a function of creating and displaying the above-mentioned handwritten document, a function of converting the handwritten document into a formatted document including a character code, and the like.

また、ＣＰＵ１０１は、ＢＩＯＳ−ＲＯＭ１０５に格納された基本入出力システム（ＢＩＯＳ）も実行する。ＢＩＯＳは、ハードウェア制御のためのプログラムである。 The CPU 101 also executes a basic input / output system (BIOS) stored in the BIOS-ROM 105. The BIOS is a program for hardware control.

システムコントローラ１０２は、ＣＰＵ１０１のローカルバスと各種コンポーネントとの間を接続するデバイスである。システムコントローラ１０２には、主メモリ１０３をアクセス制御するメモリコントローラも内蔵されている。また、システムコントローラ１０２は、ＰＣＩＥＸＰＲＥＳＳ規格のシリアルバスなどを介してグラフィクスコントローラ１０４との通信を実行する機能も有している。 The system controller 102 is a device that connects the local bus of the CPU 101 and various components. The system controller 102 also includes a memory controller that controls access to the main memory 103. The system controller 102 also has a function of executing communication with the graphics controller 104 via a PCI Express standard serial bus or the like.

グラフィクスコントローラ１０４は、本タブレットコンピュータ１０のディスプレイモニタとして使用されるＬＣＤ１７Ａを制御する表示コントローラである。このグラフィクスコントローラ１０４によって生成される表示信号はＬＣＤ１７Ａに送られる。ＬＣＤ１７Ａは、表示信号に基づいて画面イメージを表示する。このＬＣＤ１７Ａ上にはタッチパネル１７Ｂおよびデジタイザ１７Ｃが配置されている。タッチパネル１７Ｂは、ＬＣＤ１７Ａの画面上で入力を行うための静電容量式のポインティングデバイスである。指が接触される画面上の接触位置および接触位置の動き等はタッチパネル１７Ｂによって検出される。デジタイザ１７ＣはＬＣＤ１７Ａの画面上で入力を行うための電磁誘導式のポインティングデバイスである。ペン１００が接触される画面上の接触位置および接触位置の動き等はデジタイザ１７Ｃによって検出される。 The graphics controller 104 is a display controller that controls the LCD 17 </ b> A used as a display monitor of the tablet computer 10. A display signal generated by the graphics controller 104 is sent to the LCD 17A. The LCD 17A displays a screen image based on the display signal. A touch panel 17B and a digitizer 17C are disposed on the LCD 17A. The touch panel 17B is a capacitance-type pointing device for inputting on the screen of the LCD 17A. The touch position on the screen where the finger is touched and the movement of the touch position are detected by the touch panel 17B. The digitizer 17C is an electromagnetic induction type pointing device for inputting on the screen of the LCD 17A. The digitizer 17C detects the contact position on the screen where the pen 100 is touched, the movement of the contact position, and the like.

無線通信デバイス１０７は、無線ＬＡＮまたは３Ｇ移動通信などの無線通信を実行するように構成されたデバイスである。ＥＣ１０８は、電力管理のためのエンベデッドコントローラを含むワンチップマイクロコンピュータである。ＥＣ１０８は、ユーザによるパワーボタンの操作に応じて本タブレットコンピュータ１０を電源オンまたは電源オフする機能を有している。 The wireless communication device 107 is a device configured to perform wireless communication such as wireless LAN or 3G mobile communication. The EC 108 is a one-chip microcomputer including an embedded controller for power management. The EC 108 has a function of turning on or off the tablet computer 10 in accordance with the operation of the power button by the user.

次に、図５を参照して、デジタルノートブックアプリケーションプログラム２０２の機能構成について説明する。デジタルノートブックアプリケーションプログラム２０２は、タッチスクリーンディスプレイ１７を用いた手書き入力操作によって入力されるストロークデータを使用することによって、手書き文書の作成、表示、編集等を行う。また、デジタルノートブックアプリケーションプログラム２０２は、手書き文書を整形する、すなわち、手書き文書内の手書き文字を文字コードに変換し、手書き文書上での文字サイズおよび配置に基づいて構成された整形文書データを生成可能である。 Next, the functional configuration of the digital notebook application program 202 will be described with reference to FIG. The digital notebook application program 202 creates, displays, and edits a handwritten document by using stroke data input by a handwriting input operation using the touch screen display 17. Also, the digital notebook application program 202 formats the handwritten document, that is, converts the handwritten characters in the handwritten document into character codes, and converts the formatted document data configured based on the character size and arrangement on the handwritten document. Can be generated.

デジタルノートブックアプリケーションプログラム２０２は、例えば、軌跡表示処理部３０１、時系列情報生成部３０２、行認識部３０３、文字認識部３０４、文字グループ認識部３０５、整形文書生成部３０６、ページ保存処理部３０７、ページ取得処理部３０８、文書表示処理部３０９等を備える。 The digital notebook application program 202 includes, for example, a trajectory display processing unit 301, a time-series information generation unit 302, a line recognition unit 303, a character recognition unit 304, a character group recognition unit 305, a formatted document generation unit 306, and a page storage processing unit 307. A page acquisition processing unit 308, a document display processing unit 309, and the like.

タッチスクリーンディスプレイ１７は、「タッチ」、「移動（スライド）」、「リリース」等のイベントの発生を検出するように構成されている。「タッチ」は、画面上に外部オブジェクトが接触したことを示すイベントである。「移動（スライド）」は、画面上に外部オブジェクトが接触されている間に接触位置が移動されたことを示すイベントである。「リリース」は、画面から外部オブジェクトが離されたことを示すイベントである。 The touch screen display 17 is configured to detect the occurrence of events such as “touch”, “move (slide)”, and “release”. “Touch” is an event indicating that an external object has touched the screen. “Move (slide)” is an event indicating that the contact position has been moved while an external object is in contact with the screen. “Release” is an event indicating that an external object has been released from the screen.

軌跡表示処理部３０１および時系列情報生成部３０２は、タッチスクリーンディスプレイ１７によって発生される「タッチ」または「移動（スライド）」のイベントを受信し、これによって手書き入力操作を検出する。「タッチ」イベントには、接触位置の座標が含まれている。「移動（スライド）」イベントには、移動先の接触位置の座標が含まれている。したがって、軌跡表示処理部３０１および時系列情報生成部３０２は、タッチスクリーンディスプレイ１７から、接触位置の動きの軌跡に対応する座標列を受信することができる。 The trajectory display processing unit 301 and the time-series information generation unit 302 receive a “touch” or “move (slide)” event generated by the touch screen display 17 and thereby detect a handwriting input operation. The “touch” event includes the coordinates of the contact position. The “movement (slide)” event includes the coordinates of the contact position of the movement destination. Therefore, the trajectory display processing unit 301 and the time-series information generation unit 302 can receive a coordinate sequence corresponding to the trajectory of the movement of the contact position from the touch screen display 17.

軌跡表示処理部３０１は、タッチスクリーンディスプレイ１７から座標列を受信し、この座標列に基づいて、ペン１００等を使用した手書き入力操作によって手書きされる各ストロークの軌跡をタッチスクリーンディスプレイ１７内のＬＣＤ１７Ａの画面上に表示する。この軌跡表示処理部３０１により、画面にペン１００が接触している間のペン１００の軌跡、つまり各ストロークの軌跡がＬＣＤ１７Ａの画面上に描かれる。 The trajectory display processing unit 301 receives a coordinate string from the touch screen display 17, and based on the coordinate string, the trajectory of each stroke handwritten by a handwriting input operation using the pen 100 or the like is displayed on the LCD 17A in the touch screen display 17. On the screen. The trajectory display processing unit 301 draws the trajectory of the pen 100 while the pen 100 is in contact with the screen, that is, the trajectory of each stroke, on the screen of the LCD 17A.

時系列情報生成部３０２は、タッチスクリーンディスプレイ１７から出力される上述の座標列を受信し、この座標列に基づいて、図３で詳述したような構造を有する時系列情報（ストロークデータ）を生成する。この場合、時系列情報、つまりストロークの各点に対応する座標およびタイムスタンプ情報は作業メモリ４０１に一時保存してもよい。 The time-series information generation unit 302 receives the above-described coordinate sequence output from the touch screen display 17, and based on this coordinate sequence, generates time-series information (stroke data) having the structure described in detail in FIG. Generate. In this case, time series information, that is, coordinates and time stamp information corresponding to each point of the stroke may be temporarily stored in the work memory 401.

ページ保存処理部３０７は、生成された時系列情報（作業メモリ４０１に一時保存された時系列情報）を、手書き文書データとして記憶媒体４０２に保存する。記憶媒体４０２は、例えば、タブレットコンピュータ１０内のストレージデバイスである。 The page storage processing unit 307 stores the generated time series information (time series information temporarily stored in the work memory 401) in the storage medium 402 as handwritten document data. The storage medium 402 is, for example, a storage device in the tablet computer 10.

ページ取得処理部３０８は、記憶媒体４０２から既に格納されている任意の手書き文書データを読み出す。読み出された手書き文書データは文書表示処理部３０９に送られる。文書表示処理部３０９は、手書き文書データを解析し、この解析結果に基づいて、時系列情報によって示される各ストロークの軌跡を、画面上に手書き文書（手書きページ）として表示する。
上述の構成により、ユーザは、手書きの文字を含む手書き文書を作成および閲覧することができる。 The page acquisition processing unit 308 reads arbitrary handwritten document data already stored from the storage medium 402. The read handwritten document data is sent to the document display processing unit 309. The document display processing unit 309 analyzes the handwritten document data, and displays the trajectory of each stroke indicated by the time series information as a handwritten document (handwritten page) on the screen based on the analysis result.
With the above-described configuration, the user can create and browse a handwritten document including handwritten characters.

また、作成された手書き文書は、文字認識によって文字コードに変換することもできる。図６は、手書き文書（手書きページ）６１が文字認識される例を示す。文字認識によって手書き文書６１内の文字は文字コードに変換され、その文字コードが文字認識結果６２として出力される。この文字認識結果６２では、認識された文字が手書き文書６１上での行毎に配置されている。例えば、手書き文書６１上の「まとめ」という１行の文字列が、文字認識結果６２上でも「まとめ」という１行の文字列（文字コード列）として表示されている。また、例えば、手書き文書６１上の「今日の打合せで……下さい。」という一文は、手書き文書６１上では複数の行にわたって手書きされている。文字認識結果６２上では、この一文は、手書き文書６１上での行毎に改行され、一つの文であるにも関わらず途切れた状態で表示されている。 The created handwritten document can be converted into a character code by character recognition. FIG. 6 shows an example in which a handwritten document (handwritten page) 61 is recognized. Character recognition converts the character in the handwritten document 61 into a character code, and the character code is output as a character recognition result 62. In the character recognition result 62, the recognized characters are arranged for each line on the handwritten document 61. For example, a single character string “summary” on the handwritten document 61 is also displayed as a single character string (character code string) “summary” on the character recognition result 62. Further, for example, a sentence “Please meet today” on the handwritten document 61 is handwritten over a plurality of lines on the handwritten document 61. On the character recognition result 62, this one sentence is broken for each line on the handwritten document 61 and displayed in a state where it is interrupted even though it is a single sentence.

このように、手書き文書６１の文字認識だけでは、認識された文字（文字コード）が単に並べられるだけであり、手書き文書６１内の文字の位置や大きさ、段落のような構成に関する情報が失われてしまう。そのため、例えば、一つの文が途切れた状態で表示され、文書として利用しづらい可能性がある。また、そのような文書は、ユーザにとっても読みにくいものである可能性が高い。 In this way, the recognized characters (character codes) are simply arranged by simply recognizing the characters of the handwritten document 61, and information on the position and size of characters in the handwritten document 61 and the structure such as paragraphs are lost. I will be broken. Therefore, for example, there is a possibility that one sentence is displayed in an interrupted state and is difficult to use as a document. Also, such a document is likely to be difficult for a user to read.

そのため本実施形態では、図７に示すように、手書き文書６１に含まれる手書き文字に対応する文字コードを認識し、さらに、認識された文字コードの配置が整形された文書（以下、整形ページとも称する）６３を生成する。すなわち、本実施形態では、手書き文書６１を清書した文書を生成する。 Therefore, in the present embodiment, as shown in FIG. 7, a character code corresponding to a handwritten character included in the handwritten document 61 is recognized, and a document in which the arrangement of the recognized character code is formatted (hereinafter, also referred to as a formatted page). 63) is generated. That is, in the present embodiment, a document in which the handwritten document 61 is clarified is generated.

整形文書（整形ページ）６３では、「２０１２０３／２８」という文字列が、手書き文書６１上での位置に対応する位置に配置されている。整形文書６３では、手書き文書６１における、箇条書きされた２つの項目の先頭の字下げ（インデント）が維持されている。また、整形文書６３では、「今日の打合せで……下さい。」という一文が途切れないように、対応する文字が配置されている。さらに、整形文書６３では、手書き文書６１上での大きさに対応するサイズ（フォントサイズ）で、文字（文字コード）が表示されている。 In the formatted document (formatted page) 63, the character string “2012 03/28” is arranged at a position corresponding to the position on the handwritten document 61. In the formatted document 63, the indentation (indent) at the head of the two itemized items in the handwritten document 61 is maintained. Further, in the formatted document 63, the corresponding characters are arranged so that one sentence “Please contact us today ... please” is not interrupted. Further, in the formatted document 63, characters (character codes) are displayed in a size (font size) corresponding to the size on the handwritten document 61.

このように本実施形態では、手書き文書６１上での文字の位置や大きさ、文字が属するグループ等の構成に関する情報を失うことなく、整形文書６３を生成する。このグループは、例えば、段落、箇条書き、見出し、表、数式のような１つのまとまりとして扱われるべき文字群が属するグループである。 As described above, in the present embodiment, the formatted document 63 is generated without losing information on the position and size of the character on the handwritten document 61 and the configuration of the group to which the character belongs. This group is a group to which a group of characters to be treated as one unit such as paragraphs, bullets, headings, tables, and mathematical formulas belongs.

以下では、時系列情報生成部３０２によって生成された時系列情報を含む手書き文書データを用いて、手書き文書を整形文書に変換する処理について説明する。 Below, the process which converts a handwritten document into a formatted document using the handwritten document data containing the time series information produced | generated by the time series information generation part 302 is demonstrated.

まず、行認識部３０３は、手書き文書のデータに含まれる複数の行を認識する。例えば、行認識部３０３は、手書き文書データを用いて、手書き文書６１上の１以上の手書き文字をそれぞれ含む複数の行を認識する。より具体的には、行認識部３０３は、手書き文書６１上に手書きされた複数のストロークに対応する複数のストロークデータを用いることにより、手書きされたストロークの座標の変化に基づいて、行を認識する。 First, the line recognition unit 303 recognizes a plurality of lines included in handwritten document data. For example, the line recognition unit 303 recognizes a plurality of lines each including one or more handwritten characters on the handwritten document 61 using handwritten document data. More specifically, the line recognition unit 303 recognizes a line based on a change in the coordinates of the handwritten stroke by using a plurality of stroke data corresponding to the plurality of strokes handwritten on the handwritten document 61. To do.

図８に示す例では、手書き文書６１上の８つの行６６１〜６６８が認識されている。この手書き文書６１のように、手書き文書６１内に文字が横書きされる場合、ユーザは、１つの行に属する文字を、手書きページ内の左から右に向かって手書きしていくことが想定される。そのため、行認識部３０３は、時系列情報（複数のストロークデータ）から、タッチスクリーンディスプレイ１７上でのオブジェクト（指またはペン１００）の接触位置が、ある行の末尾から次の行の先頭に移動していることを示す座標を検出する。 In the example shown in FIG. 8, eight lines 661 to 668 on the handwritten document 61 are recognized. When characters are horizontally written in the handwritten document 61 as in the handwritten document 61, it is assumed that the user handwrites characters belonging to one line from left to right in the handwritten page. . Therefore, the line recognition unit 303 moves the contact position of the object (finger or pen 100) on the touch screen display 17 from the end of one line to the beginning of the next line from time series information (a plurality of stroke data). Detect the coordinates that indicate

より具体的には、行認識部３０３は、手書きされた時刻順の座標データ系列を用いて、Ｘ座標（水平方向の座標）が手書き文書６１内の右から左へと大きく変化している、連続して手書きされた二つのストロークを検出する。行認識部３０３は、図８に示した例では、第Ｎストロークの最後の座標データＳＤ４ｎと、その第Ｎストロークに後続する、第（Ｎ＋１）ストロークの最初の座標データＳＤ５１とを用いて、座標データＳＤ４ｎのＸ座標と座標データＳＤ５１のＸ座標との差の絶対値がしきい値以上である場合に、第Ｎストロークまでの１以上のストロークと、第（Ｎ＋１）ストロークからの１以上のストロークとが、別々の行に属することを検出する。すなわち、第ＮストロークＳＤ４ｎまでの行６６５と、第（Ｎ＋１）ストロークからの行６６６とが検出される。
行認識部３０３は、同様にして、行の切れ目を検出することによって、手書き文書６１内の行６６１〜６６８を認識する。 More specifically, the line recognizing unit 303 uses the coordinate data series of handwritten time order, and the X coordinate (horizontal coordinate) changes greatly from right to left in the handwritten document 61. Two strokes handwritten in succession are detected. In the example shown in FIG. 8, the line recognition unit 303 uses the last coordinate data SD4n of the Nth stroke and the first coordinate data SD51 of the (N + 1) th stroke subsequent to the Nth stroke, When the absolute value of the difference between the X coordinate of the data SD4n and the X coordinate of the coordinate data SD51 is greater than or equal to the threshold value, one or more strokes up to the Nth stroke and one or more strokes from the (N + 1) th stroke Detect that they belong to different rows. That is, a row 665 up to the Nth stroke SD4n and a row 666 from the (N + 1) th stroke are detected.
Similarly, the line recognition unit 303 recognizes lines 661 to 668 in the handwritten document 61 by detecting line breaks.

次いで、文字認識部３０４は、認識された複数の行に含まれる複数の手書き文字に対応する文字コードを認識する。つまり、文字認識部３０４は、複数の行の各々に含まれる手書き文字を文字認識することによって、それら手書き文字を文字コードに変換する。文字認識部３０４は、各行内の複数のストロークの内の１以上のストロークに対応する文字を認識する。 Next, the character recognition unit 304 recognizes character codes corresponding to a plurality of handwritten characters included in the recognized lines. That is, the character recognition unit 304 converts the handwritten characters into character codes by recognizing the handwritten characters included in each of the plurality of lines. The character recognition unit 304 recognizes a character corresponding to one or more strokes among a plurality of strokes in each line.

図９に示すように、手書き文書６１上の複数の行６６１〜６６８の各々に手書きされた複数のストロークは、認識された文字毎に、対応する１以上のストロークが対応付けられる。つまり、文字認識部３０４による文字認識の結果、複数のストロークが文字毎のブロック６５１，６５２，６５３，６５４，……，６５ｎに分割される。例えば、行６６１に手書きされた「２０１２０３／２８」に対応する複数のストロークは、「２」、「０」、「１」、「２」、「０」、「３」、「２」、および「８」に対応する、文字毎のブロックに分割される。 As shown in FIG. 9, the plurality of strokes handwritten in each of the plurality of lines 661 to 668 on the handwritten document 61 are associated with one or more corresponding strokes for each recognized character. That is, as a result of character recognition by the character recognition unit 304, a plurality of strokes are divided into blocks 651, 652, 653, 654,. For example, a plurality of strokes corresponding to “2012 03/28” handwritten in the row 661 are “2”, “0”, “1”, “2”, “0”, “3”, “2”, And a block for each character corresponding to “8”.

文字認識部３０４は、例えば、複数のストロークの内の、１以上のストローク（処理対象のストローク）に対応する１以上のストロークデータを用いて、その１以上のストロークの形状を示す第１特徴量を算出する。そして、文字認識部３０４は、記憶媒体４０２に予め格納された文字辞書データを用いて、算出された第１特徴量と類似する特徴量を有する文字を検出する。この文字辞書データでは、例えば、複数の文字と、それら複数の文字に対応する複数の特徴量とが規定されている。したがって、文字認識部３０４は、文字辞書データに規定された複数の文字から、算出された第１特徴量との類似度がしきい値以上である、第２特徴量を有する文字を認識することによって、処理対象のブロック内の手書き文字を文字コードに変換する。 The character recognition unit 304 uses, for example, one or more stroke data corresponding to one or more strokes (strokes to be processed) among a plurality of strokes, and the first feature amount indicating the shape of the one or more strokes Is calculated. Then, the character recognition unit 304 detects characters having a feature amount similar to the calculated first feature amount using character dictionary data stored in advance in the storage medium 402. In the character dictionary data, for example, a plurality of characters and a plurality of feature amounts corresponding to the plurality of characters are defined. Therefore, the character recognizing unit 304 recognizes a character having the second feature amount whose similarity to the calculated first feature amount is equal to or greater than a threshold value from a plurality of characters defined in the character dictionary data. Thus, handwritten characters in the block to be processed are converted into character codes.

なお、文字認識部３０４は、文字辞書データに規定された複数の文字から、第１特徴量との類似度がしきい値以上である特徴量を有する複数の文字候補を検出してもよい。その場合、文字認識部３０４は、例えば、単語や文字の共起確率等を示す言語辞書データと、処理対象のストロークの近傍のストローク（例えば、処理対象のストロークの左右のストローク）から認識された文字とに基づいて、検出された複数の文字候補から、処理対象のストロークに対して尤もらしい文字（尤度が高い文字）を絞り込む。これにより、文字認識部３０４は、処理対象のストロークに対応する文字を認識する。 Note that the character recognition unit 304 may detect a plurality of character candidates having a feature amount whose similarity to the first feature amount is greater than or equal to a threshold value from a plurality of characters defined in the character dictionary data. In that case, the character recognition unit 304 is recognized from, for example, language dictionary data indicating the co-occurrence probabilities of words and characters, and strokes in the vicinity of the processing target stroke (for example, the left and right strokes of the processing target stroke). Based on the characters, the likely characters (characters with high likelihood) are narrowed down from the detected plurality of character candidates with respect to the stroke to be processed. As a result, the character recognition unit 304 recognizes a character corresponding to the stroke to be processed.

文字認識部３０４はさらに、認識した文字の大きさに基づいて、文字（文字コード）を表示すべきフォントサイズを算出する。文字認識部３０４は、例えば、行認識部３０３によって認識された複数の行の各々に含まれる複数の手書き文字の大きさ（例えば、複数の手書き文字の大きさの平均）に基づいて、その行の複数の文字コードを表示すべきフォントサイズを算出する。なお、文字認識部３０４は、複数の行の各々に含まれる複数の手書き文字の大きさが第１範囲内に収まっている場合（すなわち、大きさのばらつきが一定の範囲内に収まっている場合）に、複数の手書き文字の大きさの平均を、複数の文字コードを表示すべきフォントサイズに設定してもよい。文字認識部３０４は、例えば、図９に示した手書き文字「料」６５５の大きさと手書き文字「て」６５６の大きさとが第１範囲内に収まっていると判定し、これらの文字６５５，６５６に関連付ける１つのフォントサイズを算出する。 The character recognition unit 304 further calculates a font size for displaying the character (character code) based on the recognized character size. The character recognizing unit 304, for example, based on the size of a plurality of handwritten characters included in each of the plurality of rows recognized by the line recognizing unit 303 (for example, the average of the sizes of the plurality of handwritten characters) The font size for displaying a plurality of character codes is calculated. The character recognizing unit 304 determines that the size of the plurality of handwritten characters included in each of the plurality of lines is within the first range (that is, the size variation is within a certain range). ), The average of the sizes of a plurality of handwritten characters may be set to a font size for displaying a plurality of character codes. For example, the character recognizing unit 304 determines that the size of the handwritten character “charge” 655 and the size of the handwritten character “te” 656 shown in FIG. 9 are within the first range, and these characters 655 and 656. One font size associated with is calculated.

より具体的には、文字認識部３０４は、行認識部３０３によって認識された複数の行の内の第１行に含まれる複数の第１手書き文字の大きさに基づいて、第１フォントサイズを算出する。また、文字認識部３０４は、複数の行の内の第２行に含まれる複数の第２手書き文字の大きさに基づいて、第２フォントサイズを算出する。このような処理によって、文字認識部３０４は、文字コードを表示すべきフォントサイズを決定する。なお、文字認識部３０４は、行毎ではなく、後述するグループ毎にフォントサイズを決定してもよい。 More specifically, the character recognition unit 304 sets the first font size based on the sizes of the plurality of first handwritten characters included in the first line among the plurality of lines recognized by the line recognition unit 303. calculate. In addition, the character recognition unit 304 calculates a second font size based on the sizes of the plurality of second handwritten characters included in the second line of the plurality of lines. Through such processing, the character recognition unit 304 determines the font size for displaying the character code. Note that the character recognition unit 304 may determine the font size for each group, which will be described later, instead of for each row.

次いで、文字グループ認識部３０５および整形文書生成部３０６は、手書き文書６１内の構成を示すグループを認識し、認識されたグループに基づいて配置された文字コード（文字認識部３０４によって認識された文字コード）を含む整形文書データを生成する。文字グループ認識部３０５および整形文書生成部３０６は、例えば、行認識部３０３によって認識された複数の行の内の第１行と、当該第１行に後続する第２行とが第１条件を満たす場合に、第１行に対応する複数の第１文字コードと第２行に対応する複数の第２文字コードとを用いて、少なくとも１つの第１文字コードに対応する文字を第２行に対応する位置に含むか、または少なくとも１つの第２文字コードに対応する文字を第１行に対応する位置に含む第１整形文書データを生成可能である。したがって例えば、文字グループ認識部３０５および整形文書生成部３０６は、第１行と第２行とが第１条件を満たす場合、複数の第１文字コードと複数の第２文字コードとの間に改行コードを挿入せずに、第１整形文書データを生成する。また、文字グループ認識部３０５および整形文書生成部３０６は、第１行と第２行とが第１条件を満たさない場合、複数の第１文字コードと複数の第２文字コードとを用いて、複数の第１文字コードに対応する複数の文字を第１行に対応する位置に含み、複数の第２文字コードに対応する複数の文字を第２行に対応する位置に含む第２整形文書データを生成可能である。したがって例えば、文字グループ認識部３０５および整形文書生成部３０６は、第１行と第２行とが第１条件を満たさない場合、複数の第１文字コードと複数の第２文字コードとの間に改行コードを挿入して、第２整形文書データを生成する。 Next, the character group recognition unit 305 and the formatted document generation unit 306 recognize the group indicating the configuration in the handwritten document 61, and the character codes (characters recognized by the character recognition unit 304) arranged based on the recognized group. Coded document data including code) is generated. The character group recognition unit 305 and the formatted document generation unit 306, for example, have the first condition among the first line among the plurality of lines recognized by the line recognition unit 303 and the second line following the first line. When satisfying, using a plurality of first character codes corresponding to the first line and a plurality of second character codes corresponding to the second line, characters corresponding to at least one first character code are set in the second line. It is possible to generate the first formatted document data including the character corresponding to the first character or the character corresponding to the at least one second character code at the position corresponding to the first line. Therefore, for example, when the first line and the second line satisfy the first condition, the character group recognition unit 305 and the formatted document generation unit 306 have a line break between the plurality of first character codes and the plurality of second character codes. First formatted document data is generated without inserting a code. Further, when the first line and the second line do not satisfy the first condition, the character group recognition unit 305 and the formatted document generation unit 306 use a plurality of first character codes and a plurality of second character codes, Second formatted document data including a plurality of characters corresponding to the plurality of first character codes at a position corresponding to the first line and a plurality of characters corresponding to the plurality of second character codes at a position corresponding to the second line. Can be generated. Therefore, for example, when the first line and the second line do not satisfy the first condition, the character group recognizing unit 305 and the formatted document generating unit 306 are arranged between the plurality of first character codes and the plurality of second character codes. A line feed code is inserted to generate second formatted document data.

より具体的には、文字グループ認識部３０５は、行認識部３０３によって認識された複数の行と、文字認識部３０４によって認識された複数の文字（文字コード）とに基づいて、手書き文書６１内の構成を示すグループを認識する。このグループは、例えば、段落、箇条書き、見出し、表、数式等の１つのまとまりとして扱われるべき文字群が属するグループである。 More specifically, the character group recognizing unit 305 includes a plurality of lines recognized by the line recognizing unit 303 and a plurality of characters (character codes) recognized by the character recognizing unit 304 in the handwritten document 61. Recognize the group that shows the configuration This group is a group to which a group of characters to be treated as one unit such as paragraphs, bullets, headings, tables, and mathematical formulas belongs.

例えば、文字グループ認識部３０５は、認識された複数の行の内の第１行と、当該第１行に後続する第２行とが第１条件を満たす場合に、それら第１行と第２行とが一つの段落のグループに含まれることを認識し、また、第１行と第２行とが第１条件を満たさない場合に、それら第１行と第２行とが一つの段落のグループに含まれないこと（別々の段落グループであること）を認識する。 For example, the character group recognizing unit 305 determines that the first line and the second line when the first line of the recognized lines and the second line subsequent to the first line satisfy the first condition. Line is included in a group of paragraphs, and when the first line and the second line do not satisfy the first condition, the first line and the second line are included in one paragraph. Recognize that it is not in a group (a separate paragraph group).

この第１条件は、例えば、手書き文書６１において第１行の水平方向の位置と第２行の水平方向の位置とが揃っていること、である。文字グループ認識部３０５は、例えば、手書き文書６１における、第１行（例えば、第１行の先頭の文字）の水平方向の位置（Ｘ座標）と、第２行（例えば、第２行の先頭の文字）の水平方向の位置（Ｘ座標）との差がしきい値未満である場合に、第１行と第２行とが一つの段落グループに含まれることを認識する。そして、文字グループ認識部３０５は、第１行の水平方向の位置（Ｘ座標）と、第２行の水平方向の位置（Ｘ座標）との差がしきい値以上である場合に、第１行と第２行とが一つの段落グループに含まれないことを認識する。 This first condition is, for example, that the horizontal position of the first line and the horizontal position of the second line are aligned in the handwritten document 61. The character group recognition unit 305, for example, in the handwritten document 61, the horizontal position (X coordinate) of the first line (for example, the first character of the first line) and the second line (for example, the first line of the second line). If the difference between the horizontal position (X coordinate) of the first character and the second character is less than the threshold value, it is recognized that the first line and the second line are included in one paragraph group. When the difference between the horizontal position (X coordinate) of the first row and the horizontal position (X coordinate) of the second row is equal to or greater than the threshold value, the character group recognition unit 305 Recognize that the line and the second line are not included in one paragraph group.

文字グループ認識部３０５および整形文書生成部３０６は、さらに、箇条書きを含む手書き文書を整形文書に変換することもできる。文字グループ認識部３０５および整形文書生成部３０６は、例えば、手書き文書内の第１行に対応する複数の第１文字コードの内の先頭の文字コードと、第１行に後続する第２行に対応する複数の第２文字コードの内の先頭の文字コードとが、特定の文字コード（第３文字コード）である場合、複数の第１文字コードと複数の第２文字コードとを用いて、複数の第１文字コードに対応する複数の文字を第１行に対応する位置に含み、複数の第２文字コードに対応する複数の文字を第２行に対応する位置に含む第２整形文書データを生成する。また、文字グループ認識部３０５および整形文書生成部３０６は、複数の第１文字コードの内の先頭の文字コードと、複数の第２文字コードの内の先頭の文字コードとが、第３文字コードでない場合、複数の第１文字コードと複数の第２文字コードとを用いて、複数の第１文字コードの内の少なくとも１つの文字コードに対応する文字を第２行に対応する位置に含むか、または複数の第２文字コードの内の少なくとも１つの文字コードに対応する文字を第１行に対応する位置に含む第３整形文書データを生成する。 The character group recognition unit 305 and the formatted document generation unit 306 can also convert a handwritten document including itemized items into a formatted document. The character group recognition unit 305 and the formatted document generation unit 306, for example, in the first character code of a plurality of first character codes corresponding to the first line in the handwritten document and the second line following the first line. When the first character code of the corresponding plurality of second character codes is a specific character code (third character code), using the plurality of first character codes and the plurality of second character codes, Second formatted document data including a plurality of characters corresponding to the plurality of first character codes at a position corresponding to the first line and a plurality of characters corresponding to the plurality of second character codes at a position corresponding to the second line. Is generated. In addition, the character group recognition unit 305 and the formatted document generation unit 306 are configured such that the first character code of the plurality of first character codes and the first character code of the plurality of second character codes are the third character code. If not, whether a character corresponding to at least one of the plurality of first character codes is included in a position corresponding to the second line using the plurality of first character codes and the plurality of second character codes Or third formatted document data including a character corresponding to at least one of the plurality of second character codes at a position corresponding to the first line.

より具体的には、文字グループ認識部３０５は、認識された段落グループが、箇条書きを構成する複数の行を含むグループであることをさらに認識する。文字グループ認識部３０５は、例えば、手書き文書６１において、複数の行の内の第１行の先頭の文字の文字コードと、この第１行に後続する第２行の先頭の文字の文字コードとが特定の文字コード（第３文字コード）である場合に、それら第１行と第２行とが１つの箇条書きのグループに含まれることを認識する。また、文字グループ認識部３０５は、第１行の先頭の文字の文字コードと第２行の先頭の文字の文字コードとが特定の文字コードでない場合に、それら第１行と第２行とが１つの箇条書きのグループに含まれないことを認識する。この特定の文字は、例えば、「・」、「□」、「〇」のような、箇条書きで用いられることが規定された記号や文字に対応する文字コードである。 More specifically, the character group recognizing unit 305 further recognizes that the recognized paragraph group is a group including a plurality of lines constituting the itemized list. For example, in the handwritten document 61, the character group recognition unit 305 includes a character code of the first character of the first line of the plurality of lines, and a character code of the first character of the second line following the first line. Is a specific character code (third character code), it is recognized that the first line and the second line are included in one bulleted group. Also, the character group recognition unit 305 determines whether the first line and the second line are different when the character code of the first character in the first line and the character code of the first character in the second line are not specific character codes. Recognize that it is not included in a single bulleted group. This specific character is, for example, a character code corresponding to a symbol or character that is specified to be used in a bulleted list, such as “•”, “□”, and “◯”.

図１０は、手書き文書６１において認識されたグループ６７１〜６７４を示す。
文字グループ認識部３０５は、図８に示した行６６１に対応する文字列「２０１２０３／２８」の先頭の文字「２」のＸ座標と、行６６２に対応する文字列「まとめ」の先頭の文字「ま」のＸ座標との差（差の絶対値）がしきい値以上であるので、行６６１と行６６２とを別々の段落グループ６７１，６７２として認識している。また、文字グループ認識部３０５は、図８に示した行６６５に対応する文字列「今日の…」、行６６６に対応する文字列「決まった…」、行６６７に対応する文字列「の準備…」、および行６６８に対応する文字列「さい。」の先頭の文字同士のＸ座標の差（差の絶対値）がしきい値未満であるので、それら行６６５，６６６，６６７，６６８を一つの段落グループ６７４として認識している。これにより、文字グループ認識部３０５は、手書き文書６１内の段落に対応する段落グループ６７４を認識することができる。 FIG. 10 shows groups 671 to 674 recognized in the handwritten document 61.
The character group recognition unit 305 displays the X coordinate of the first character “2” of the character string “2012 03/28” corresponding to the line 661 shown in FIG. 8 and the first character of the character string “summary” corresponding to the line 662. Since the difference between the character “MA” and the X coordinate (absolute value of the difference) is equal to or greater than the threshold value, line 661 and line 662 are recognized as separate paragraph groups 671 and 672. Also, the character group recognition unit 305 prepares the character string “Today's ...” corresponding to the row 665 shown in FIG. 8, the character string “decided ...” corresponding to the row 666, and the character string “corresponding to the row 667. .. ”And the difference between the X coordinates (absolute value of the difference) between the first characters of the character string“ sai ”corresponding to the line 668 is less than the threshold value, so that the lines 665, 666, 667, 668 are This is recognized as one paragraph group 674. Thereby, the character group recognition unit 305 can recognize the paragraph group 674 corresponding to the paragraph in the handwritten document 61.

さらに、文字グループ認識部３０５は、図８に示した行６６３の先頭の文字の文字コード「□」と、行６６４の先頭の文字の文字コード「□」とが特定の文字コードであるので、それら行６６３，６６４を一つの箇条書きのグループ６７３として認識している。これにより、文字グループ認識部３０５は、手書き文書６１内の箇条書きに対応するグループ６７３を認識することができる。 Furthermore, the character group recognizing unit 305 has the character code “□” of the first character of the line 663 and the character code “□” of the first character of the line 664 shown in FIG. These rows 663 and 664 are recognized as one itemized group 673. Thereby, the character group recognition unit 305 can recognize the group 673 corresponding to the itemized list in the handwritten document 61.

整形文書生成部３０６は、文字グループ認識部３０５によって認識されたグループに基づいて、文字認識部３０４によって認識された複数の文字（文字コード）が配置された整形文書データを生成する。整形文書生成部３０６は、認識された文字コードが、認識されたグループ６７１〜６７４の手書き文書６１上での位置に基づいて整形文書６３上に配置された整形文書データを生成する。整形文書生成部３０６は、この整形文書データにおいて、例えば、段落グループ内の複数の行の行間に改行コードを挿入せず、箇条書きグループ内の複数の行の行間に改行コードを挿入し、認識されたグループ間に改行コードを挿入する。 Based on the group recognized by the character group recognition unit 305, the formatted document generation unit 306 generates formatted document data in which a plurality of characters (character codes) recognized by the character recognition unit 304 are arranged. The formatted document generation unit 306 generates formatted document data in which the recognized character codes are arranged on the formatted document 63 based on the positions of the recognized groups 671 to 674 on the handwritten document 61. The formatted document generation unit 306 recognizes the formatted document data by inserting a line feed code between a plurality of lines in a bulleted group without inserting a line feed code between a plurality of lines in the paragraph group. Insert a line feed code between specified groups.

また、整形文書生成部３０６は、整形文書データに含まれる文字コードに、文字認識部３０４によって算出されたフォントサイズを関連付ける。例えば、整形文書生成部３０６は、生成される整形文書データにおいて、認識された複数の行の内の第１行に対応する複数の第１文字コードに第１フォントサイズを関連付け、第２行に対応する複数の第２文字コードに第２フォントサイズを関連付ける。これにより、整形文書６３において、手書き文書６１上での文字の大きさに基づくフォントサイズで、対応する文字コードを表示することができる。 The formatted document generation unit 306 associates the font size calculated by the character recognition unit 304 with the character code included in the formatted document data. For example, the formatted document generation unit 306 associates the first font size with the plurality of first character codes corresponding to the first line among the plurality of recognized lines in the generated formatted document data, and sets the second line to the second line. The second font size is associated with the corresponding plurality of second character codes. Thereby, in the formatted document 63, the corresponding character code can be displayed with the font size based on the size of the character on the handwritten document 61.

図７に示したように、整形文書生成部３０６は、認識されたグループ６７１〜６７４の手書き文書６１上での位置に基づいて、認識された文字コードを整形文書６３上に配置する。整形文書生成部３０６は、手書き文書６１上の段落グループ６７１の位置に対応する整形文書６３上の位置に、「２０１２０３／２８」という文字コード列を配置している。整形文書生成部３０６は、手書き文書６１上の段落グループ６７２の位置に対応する整形文書６３上の位置に、「まとめ」という文字コード列を配置している。また、整形文書生成部３０６は、手書き文書６１上のグループ６７３の位置に対応する整形文書６３上の位置に、「□」で示された２つの項目を含む箇条書きに対応する文字コード列を配置している。その際、整形文書生成部３０６は、箇条書きグループ６７３に含まれる２つの行の間に改行コードを挿入し、それら２つの行の先頭の文字コードの位置が揃うように（すなわち、２つの項目の先頭のインデントが維持されるように）、各行の先頭に同数の空白（空白文字の文字コード）を挿入している。 As illustrated in FIG. 7, the formatted document generation unit 306 arranges the recognized character codes on the formatted document 63 based on the positions of the recognized groups 671 to 674 on the handwritten document 61. The formatted document generation unit 306 arranges a character code string “2012 03/28” at a position on the formatted document 63 corresponding to the position of the paragraph group 671 on the handwritten document 61. The formatted document generation unit 306 arranges a character code string “Summary” at a position on the formatted document 63 corresponding to the position of the paragraph group 672 on the handwritten document 61. In addition, the formatted document generation unit 306 adds a character code string corresponding to the itemized list including the two items indicated by “□” at the position on the formatted document 63 corresponding to the position of the group 673 on the handwritten document 61. It is arranged. At that time, the formatted document generation unit 306 inserts a line feed code between two lines included in the bulleted group 673 so that the positions of the first character codes of the two lines are aligned (that is, two items). The same number of spaces (blank character code) is inserted at the beginning of each line (so that the indent at the beginning of the line is maintained).

さらに、整形文書生成部３０６は、手書き文書６１上の段落グループ６７４の位置に対応する整形文書６３上の位置に、「今日の打合せで……下さい。」という文字コード列を配置している。整形文書生成部３０６は、手書き文書６１上での４行の文字列に対応する４つの文字コード列が１つの段落グループ６７４に含まれるので、それら４つの文字コード列の間に改行コードを挿入していない。これにより、整形文書６３では、「今日の打合せで……下さい。」という一文が途切れないように、対応する文字コードを配置することができる。 Further, the formatted document generation unit 306 arranges a character code string “Please meet today” in the position on the formatted document 63 corresponding to the position of the paragraph group 674 on the handwritten document 61. The formatted document generation unit 306 inserts a line feed code between the four character code strings because four character code strings corresponding to the four lines of character strings on the handwritten document 61 are included in one paragraph group 674. Not done. As a result, in the formatted document 63, the corresponding character code can be arranged so that one sentence “Please meet with us today ...” is not interrupted.

ページ保存処理部３０７は、生成された整形文書データを記憶媒体４０２に保存する。 The page storage processing unit 307 stores the generated formatted document data in the storage medium 402.

ページ取得処理部３０８は、記憶媒体４０２から既に格納されている任意の整形文書データを読み出す。読み出された整形文書データは文書表示処理部３０９に送られる。文書表示処理部３０９は、整形文書データを解析し、この解析結果に基づいて、関連付けられたフォントサイズで、文字コードによって示される文字が配置された整形文書（整形ページ）を画面上に表示する。 The page acquisition processing unit 308 reads arbitrary formatted document data already stored from the storage medium 402. The read formatted document data is sent to the document display processing unit 309. The document display processing unit 309 analyzes the formatted document data, and displays on the screen a formatted document (formatted page) in which characters indicated by the character codes are arranged with an associated font size based on the analysis result. .

次いで、図１１、図１２及び図１３を参照して、手書き文書が手書きの表を含む場合について説明する。 Next, a case where the handwritten document includes a handwritten table will be described with reference to FIGS. 11, 12, and 13.

図１１は、手書きの表７１１を含む手書き文書７１が文字認識される例を示す。この手書きの表７１１では、表を明示する縦線および横線は手書きされていないが、複数の項目が垂直方向および水平方向に揃えて配置されることによって、４行×４列の表が示されている。 FIG. 11 shows an example in which a handwritten document 71 including a handwritten table 711 is character-recognized. In this handwritten table 711, vertical lines and horizontal lines that clearly indicate the table are not handwritten, but a plurality of items are arranged in the vertical direction and the horizontal direction to display a table of 4 rows × 4 columns. ing.

文字認識のみでは、手書き文書７１内の文字は、表７１１のようなグループが考慮されることなく文字コードに変換され、その文字コードが文字認識結果７２として出力される。この文字認識結果７２では、認識された文字が手書き文書７１上での行毎に配置されている。 With only character recognition, the characters in the handwritten document 71 are converted into character codes without considering the groups shown in Table 711, and the character codes are output as the character recognition result 72. In the character recognition result 72, the recognized characters are arranged for each line on the handwritten document 71.

例えば、手書き文書７１上では、「６月」、「７月」および「８月」という文字列は、表７１１内の３つの列にそれぞれ手書きされている。しかし、文字認識結果７２上では、表７１１内の列が考慮されることなく、３つの列内の文字列が「６月７月８月」という連続した文字列として表示されている。 For example, on the handwritten document 71, the character strings “June”, “July”, and “August” are respectively handwritten in three columns in the table 711. However, on the character recognition result 72, the columns in the table 711 are not considered, and the character strings in the three columns are displayed as continuous character strings “June, July, August”.

図６に示した例と同様に、手書き文書７１の文字認識だけでは、認識された文字（文字コード）が単に並べられるだけであるので、例えば、表７１１内の項目が連結された状態で表示されてしまう。つまり、表７１１の構成に関する情報が失われてしまう。 Similar to the example shown in FIG. 6, the recognized characters (character codes) are simply arranged by simply recognizing the handwritten document 71. For example, the items in the table 711 are displayed in a linked state. Will be. That is, information regarding the configuration in Table 711 is lost.

そのため、文字グループ認識部３０５は、手書き文書７１に含まれる表７１１のグループを認識する。以下の説明では、行認識部３０３および文字認識部３０４によって、手書き文書７１内の行および文字が認識済みであることを想定する。 Therefore, the character group recognition unit 305 recognizes the group in the table 711 included in the handwritten document 71. In the following description, it is assumed that the lines and characters in the handwritten document 71 have been recognized by the line recognition unit 303 and the character recognition unit 304.

文字グループ認識部３０５は、行認識部３０３によって認識された複数の行の各々で、複数の手書き文字列間の空白を検出する。例えば、文字グループ認識部３０５は、手書き文書７１内の「６月７月８月」を含む行において、「６月」と「７月」との間の空白と、「７月」と「８月」との間の空白とを検出する。そして、文字グループ認識部３０５は、検出された空白が、複数の行にまたがって同様の水平方向の位置にあるかどうかを検出する。例えば、文字グループ認識部３０５は、「６月」と「７月」との間の空白（第１空白）の水平方向の位置と、「４こ」と「１０こ」との間の空白（第２空白）の水平方向の位置とが、所定の範囲（第１範囲）内に収まっている場合に、「６月」と「４こ」とが第１の列内にあることを認識し、「７月」と「１０こ」とが第２の列内にあることを認識する。より具体的には、文字グループ認識部３０５は、第１空白の左端のＸ座標と第２空白の左端のＸ座標との差の絶対値がしきい値以内であり、且つ第１空白の右端のＸ座標と第２空白の右端のＸ座標との差の絶対値がしきい値以内である場合に、「６月」と「４こ」とが第１の列内にあることを認識し、「７月」と「１０こ」とが第２の列内にあることを認識する。 The character group recognition unit 305 detects a space between a plurality of handwritten character strings in each of a plurality of lines recognized by the line recognition unit 303. For example, in the line including “June July August” in the handwritten document 71, the character group recognition unit 305, the space between “June” and “July”, “July” and “8 Detects a space between "month". Then, the character group recognition unit 305 detects whether or not the detected blank is at the same horizontal position across a plurality of lines. For example, the character group recognition unit 305 determines the horizontal position of the space (first space) between “June” and “July” and the space between “4” and “10” ( When the horizontal position of (second blank) is within the predetermined range (first range), it is recognized that “June” and “4” are in the first column. , "July" and "10" are recognized in the second column. More specifically, the character group recognizing unit 305 determines that the absolute value of the difference between the leftmost X coordinate of the first blank and the leftmost X coordinate of the second blank is within a threshold value, and the right edge of the first blank If the absolute value of the difference between the X-coordinate of X and the X-coordinate of the right end of the second blank is within the threshold, it is recognized that “June” and “4” are in the first column. , "July" and "10" are recognized in the second column.

したがって、図１２に示すように、手書き文書７１内の表７１１では、各行内の空白７３Ｓの位置が、複数の行にわたって所定の範囲内に収まっているので、表７１１内の列７３Ａ，７３Ｂ，７３Ｃ，７３Ｄが認識される。これにより、文字グループ認識部３０５は、手書き文書７１内の表のグループ７３を認識することができる。 Accordingly, as shown in FIG. 12, in the table 711 in the handwritten document 71, the position of the blank 73S in each line is within a predetermined range over a plurality of lines, so the columns 73A, 73B, 73C and 73D are recognized. Thus, the character group recognition unit 305 can recognize the table group 73 in the handwritten document 71.

整形文書生成部３０６は、手書き文書７１上の表のグループ７３の位置に対応する整形文書（整形ページ）７４上の位置に、表７１１内の手書き文字から認識された文字コードが配置された整形文書データを生成する。整形文書生成部３０６は、表のグループ７３内の列７３Ａ，７３Ｂ，７３Ｃ，７３Ｄの位置に基づいて、列毎に、表７１１内の文字（文字列）に対応する文字コードを同一の水平方向の位置に（すなわち、列毎の複数の項目の左端を揃えて）配置する。 The formatted document generation unit 306 is a format in which the character code recognized from the handwritten characters in the table 711 is arranged at the position on the formatted document (formatted page) 74 corresponding to the position of the group 73 of the table on the handwritten document 71. Generate document data. Based on the positions of the columns 73A, 73B, 73C, and 73D in the table group 73, the formatted document generation unit 306 assigns the character codes corresponding to the characters (character strings) in the table 711 to the same horizontal direction. (That is, align the left ends of a plurality of items in each column).

図１３に示すように、整形文書７４では、手書きの表７１１内の項目が、列を考慮して配置されている。例えば、表７１１において同じ列７３Ｂに属する「６月」、「４こ」、「６本」、および「１１本」という項目の先頭が揃えて配置されている。これにより、手書きの表７１１の構成に関する情報が失われることなく、手書き文書７１から整形文書７４を生成することができる。 As shown in FIG. 13, in the formatted document 74, items in the handwritten table 711 are arranged in consideration of columns. For example, the heads of the items “June”, “4 pieces”, “6 pieces”, and “11 pieces” that belong to the same column 73B in the table 711 are aligned. As a result, the formatted document 74 can be generated from the handwritten document 71 without losing information on the structure of the handwritten table 711.

本実施形態ではさらに、数式やプログラムのソースコードを含む手書き文書を整形文書に変換してもよい。数式やプログラムのソースコードでは、手書き文書上での行の構造が変更されてしまうと、数式やプログラムの内容（解釈）が変化する可能性がある。そのため、数式やプログラムのソースコードでは、手書き文書上での行が維持されることが望ましい。 In the present embodiment, a handwritten document including a mathematical expression and a program source code may be converted into a formatted document. In the formula and program source code, if the line structure on the handwritten document is changed, the contents (interpretation) of the formula and the program may change. Therefore, it is desirable to maintain lines on the handwritten document in the mathematical formulas and program source code.

文字グループ認識部３０５は、例えば、手書き文書に含まれる複数の行の内の第１行に対応する複数の第１文字コードに、所定の数学記号の文字コードが含まれる場合、その第１行を一つの数式のグループとして認識する。整形文書生成部３０６は、手書き文書上での数式のグループの位置に対応する整形文書上の位置に、その数式のグループに対応する複数の文字コードが配置された整形文書データを生成する。したがって例えば、整形文書生成部３０６は、複数の第１文字コードに、所定の数学記号の文字コードが含まれる場合、複数の第１文字コードに対応する複数の文字を第１行に対応する位置に含み、第１行に後続する第２行に対応する複数の第２文字コードに対応する複数の文字を第２行に対応する位置に含む整形文書データ（第２整形文書データ）を生成する。整形文書生成部３０６は、例えば、この整形文書データにおいて、第１行に対応する複数の第１文字コードと、この第１行に後続する第２行に対応する複数の第２文字コードとの間に改行コードを挿入する。 For example, when a plurality of first character codes corresponding to the first line among a plurality of lines included in a handwritten document include a character code of a predetermined mathematical symbol, the character group recognition unit 305 first Are recognized as a group of mathematical expressions. The formatted document generation unit 306 generates formatted document data in which a plurality of character codes corresponding to a group of mathematical expressions are arranged at a position on the formatted document corresponding to the position of the group of mathematical expressions on the handwritten document. Therefore, for example, when the plurality of first character codes include a character code of a predetermined mathematical symbol, the formatted document generation unit 306 positions a plurality of characters corresponding to the plurality of first character codes in the first line. And the formatted document data (second formatted document data) including the plurality of characters corresponding to the plurality of second character codes corresponding to the second line following the first line at the position corresponding to the second line. . For example, in the formatted document data, the formatted document generation unit 306 includes a plurality of first character codes corresponding to the first line and a plurality of second character codes corresponding to the second line following the first line. Insert a line feed code between them.

また、文字グループ認識部３０５は、例えば、所定のプログラム言語の記述に関する仕様を示す記述仕様データを用いて、手書き文書に含まれる複数の行の内の第１行に対応する複数の文字コードが、プログラムのソースコードであるかどうかを判定する。この記述仕様データでは、例えば、その所定のプログラミング言語でソースコードを記述するために用いられる記号（記号の文字コード）や、文字列（例えば、クラス、メソッド、データ型、関数等として用いられる名称に対応する文字コード）などが規定されている。この記述仕様データは、例えば、記憶媒体４０２内に予め格納されている。文字グループ認識部３０５は、その複数の文字コードがプログラムのソースコードである場合、その第１行を一つのソースコードのグループとして認識する。整形文書生成部３０６は、手書き文書上でのソースコードのグループの位置に対応する整形文書上の位置に、そのソースコードのグループに対応する複数の文字コードが配置された整形文書データを生成する。したがって例えば、整形文書生成部３０６は、複数の第１文字コードがプログラムのソースコードである場合、複数の第１文字コードに対応する複数の文字を第１行に対応する位置に含み、第１行に後続する第２行に対応する複数の第２文字コードに対応する複数の文字を第２行に対応する位置に含む整形文書データ（第２整形文書データ）を生成する。整形文書生成部３０６は、この整形文書データにおいて、例えば、第１行に対応する複数の第１文字コードと、この第１行に後続する第２行に対応する複数の第２文字コードとの間に改行コードを挿入する。
以上により、数式やプログラムのソースコードを含む手書き文書では、数式やプログラムのソースコードの手書き文書上での行が維持された整形文書データを生成することができる。 In addition, the character group recognition unit 305 uses, for example, description specification data indicating specifications related to the description of a predetermined programming language, and a plurality of character codes corresponding to the first line among the plurality of lines included in the handwritten document are stored. Determine whether the source code of the program. In this description specification data, for example, a symbol (character code of the symbol) used for describing the source code in the predetermined programming language or a character string (for example, a name used as a class, method, data type, function, etc.) Character codes corresponding to) are specified. The description specification data is stored in advance in the storage medium 402, for example. When the plurality of character codes are program source codes, the character group recognition unit 305 recognizes the first line as one source code group. The formatted document generation unit 306 generates formatted document data in which a plurality of character codes corresponding to the source code group are arranged at positions on the formatted document corresponding to the positions of the source code group on the handwritten document. . Therefore, for example, when the plurality of first character codes are the source code of the program, the formatted document generation unit 306 includes a plurality of characters corresponding to the plurality of first character codes at positions corresponding to the first line, Formatted document data (second formatted document data) including a plurality of characters corresponding to a plurality of second character codes corresponding to the second row following the row at positions corresponding to the second row is generated. In this formatted document data, the formatted document generation unit 306 includes, for example, a plurality of first character codes corresponding to the first line and a plurality of second character codes corresponding to the second line following the first line. Insert a line feed code between them.
As described above, in the handwritten document including the mathematical formula and the program source code, the formatted document data in which the lines of the mathematical formula and the program source code on the handwritten document can be generated.

上述の説明では、複数の文字が水平方向に手書きされる横書きの文書での処理について述べたが、複数の文字が垂直方向に手書きされる縦書きの文書にも、上述の処理を適用することができる。その場合、行認識部３０３は、手書き文書から縦書きの行を認識する。文字認識部３０４は、認識された縦書きの行に含まれる手書き文字を文字コードに変換する。そして、文字グループ認識部３０５および整形文書生成部３０６は、認識された縦書きの行と文字コードとを用いて、縦書きの手書き文書の構成を示すグループを認識し、認識されたグループに基づいて配置された文字コードを含む整形文書データ（縦書きの整形文書データ）を生成する。 In the above description, the processing in a horizontally written document in which a plurality of characters are handwritten in the horizontal direction has been described. However, the above processing is also applied to a vertically written document in which a plurality of characters are handwritten in the vertical direction. Can do. In that case, the line recognition unit 303 recognizes a vertically written line from the handwritten document. The character recognition unit 304 converts the handwritten character included in the recognized vertical writing line into a character code. Then, the character group recognition unit 305 and the formatted document generation unit 306 recognize the group indicating the configuration of the vertically written handwritten document using the recognized vertical writing line and the character code, and based on the recognized group. Formatted document data including vertically arranged character codes (vertically written document data) is generated.

さらに、上述の説明では、時系列情報（ストロークデータ）を含む手書き文書データが整形文書データに変換される例について述べたが、紙のページに印刷された文字または手書きされた文字をスキャンすることによって、そのページの画像データを生成し、この画像データを整形文書データに変換してもよい。行認識部３０３は、画像データを用いて、画像上の複数の行を認識する。文字認識部３０４は、画像データを用いて、認識された複数の行に含まれる手書き文字を文字コードに変換する。そして、文字グループ認識部３０５および整形文書生成部３０６は、画像（手書き文書）内の構成を示すグループを認識し、認識されたグループに基づいて配置された文字コードを含む整形文書データを生成する。 Further, in the above description, an example in which handwritten document data including time-series information (stroke data) is converted into formatted document data has been described. However, scanning characters printed on paper pages or handwritten characters is scanned. Thus, the image data of the page may be generated, and the image data may be converted into formatted document data. The line recognition unit 303 recognizes a plurality of lines on the image using the image data. The character recognition unit 304 uses the image data to convert handwritten characters included in the recognized lines into character codes. The character group recognition unit 305 and the formatted document generation unit 306 recognize a group indicating the configuration in the image (handwritten document), and generate formatted document data including a character code arranged based on the recognized group. .

次いで、図１４を参照して、デジタルノートブックアプリケーション２０２によって実行される手書き入力処理の手順の例について説明する。 Next, an example of a procedure of handwriting input processing executed by the digital notebook application 202 will be described with reference to FIG.

まず、軌跡表示処理部３０１は、手書き入力操作によるペン１００等の動きの軌跡（ストローク）をディスプレイ１７Ａに表示する（ブロックＢ１１）。また、時系列情報生成部３０２は、手書き入力操作による軌跡に対応する座標列に基づいて上述の時系列情報（時系列順に並べられた複数のストロークデータ）を生成する（ブロックＢ１２）。時系列情報生成部３０２は、その時系列情報を作業メモリ４０１に一時保存してもよい。また、ページ保存処理部３０７は、時系列情報生成部３０２によって生成された時系列情報（作業メモリ４０１に一時保存された時系列情報）を手書き文書データとして記憶媒体に保存してもよい。 First, the trajectory display processing unit 301 displays a trajectory (stroke) of the movement of the pen 100 or the like by a handwriting input operation on the display 17A (block B11). In addition, the time-series information generation unit 302 generates the above-described time-series information (a plurality of stroke data arranged in time-series order) based on the coordinate sequence corresponding to the locus by the handwriting input operation (block B12). The time series information generation unit 302 may temporarily store the time series information in the work memory 401. The page storage processing unit 307 may store the time series information generated by the time series information generation unit 302 (time series information temporarily stored in the work memory 401) as handwritten document data in a storage medium.

また、図１５は、デジタルノートブックアプリケーション２０２によって実行される手書き文書変換処理の手順の例を示す。 FIG. 15 shows an example of a procedure of handwritten document conversion processing executed by the digital notebook application 202.

まず、行認識部３０３は、生成された時系列情報（手書き文書データ）を用いて、複数のストロークから複数の行を認識する（ブロックＢ２１）。また、文字認識部３０３は、複数のストロークから複数の文字を認識する（ブロックＢ２２）。例えば、文字認識部３０３は、手書き文字それぞれを文字コードに変換する。この文字認識によって、複数のストロークが文字毎のブロックに分割される。 First, the line recognition unit 303 recognizes a plurality of lines from a plurality of strokes using the generated time-series information (handwritten document data) (block B21). The character recognition unit 303 recognizes a plurality of characters from a plurality of strokes (block B22). For example, the character recognition unit 303 converts each handwritten character into a character code. By this character recognition, a plurality of strokes are divided into blocks for each character.

次いで、グループ認識部３０５は、認識された行と文字とに基づいて、段落や箇条書き、見出し、表、数式のような文字のグループを認識する（ブロックＢ２３）。そして、整形文書生成部３０６は、認識されたグループに基づいて文字（文字コード）が配置された整形ページを生成する（ブロックＢ２４）。 Next, the group recognition unit 305 recognizes a group of characters such as paragraphs, bullets, headings, tables, and mathematical expressions based on the recognized lines and characters (block B23). Then, the formatted document generation unit 306 generates a formatted page in which characters (character codes) are arranged based on the recognized group (block B24).

以上説明したように、本実施形態によれば、手書き文字を含む手書き文書を、文字コードを含む整形された文書に変換することができる。行認識部３０３は、手書き文書データを用いて、手書き文書６１上の複数の手書き文字をそれぞれ含む複数の行を認識する。文字認識部３０４は、認識された複数の行の各々に含まれる複数の手書き文字を複数の文字コードに変換する。そして、文字グループ認識部３０５および整形文書生成部３０６は、手書き文書６１内の構成を示すグループを認識し、認識されたグループに基づいて配置された文字コードを含む整形文書データを生成する。 As described above, according to the present embodiment, a handwritten document including handwritten characters can be converted into a formatted document including character codes. The line recognition unit 303 recognizes a plurality of lines each including a plurality of handwritten characters on the handwritten document 61 using the handwritten document data. The character recognition unit 304 converts a plurality of handwritten characters included in each of the recognized plurality of lines into a plurality of character codes. Then, the character group recognition unit 305 and the formatted document generation unit 306 recognize a group indicating the configuration in the handwritten document 61, and generate formatted document data including character codes arranged based on the recognized group.

これにより、デジタルノートなどの機器で手書きした文字を含む手書き文書から、ドキュメントファイル（整形文書データ）への変換（清書）を、手書きによって示した構成（文字の大きさや段組みの位置など）を損なうことなく、またユーザが何等作業を行うことなく実現することができる。 As a result, the structure (letter size, column position, etc.) that shows handwritten conversion (clear text) from a handwritten document containing characters handwritten with a device such as a digital notebook to a document file (formatted document data). This can be realized without any loss and without any work by the user.

なお、図１４および図１５のフローチャートで説明した本実施形態の処理手順は全てソフトウェアによって実行することができる。このため、この処理手順を実行するプログラムを格納したコンピュータ読み取り可能な記憶媒体を通じてこのプログラムを通常のコンピュータにインストールして実行するだけで、本実施形態と同様の効果を容易に実現することができる。 Note that all the processing procedures of the present embodiment described in the flowcharts of FIGS. 14 and 15 can be executed by software. For this reason, the same effect as this embodiment can be easily realized only by installing and executing this program on a normal computer through a computer-readable storage medium storing the program for executing this processing procedure. .

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 Although several embodiments of the present invention have been described, these embodiments are presented by way of example and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

１７Ａ…ＬＣＤ、２０２…デジタルノートブックアプリケーション、３０１…軌跡表示処理部、３０２…時系列情報生成部、３０３…行認識部、３０４…文字認識部、３０５…文字グループ認識部、３０６…整形文書生成部、３０７…ページ保存処理部、３０８…ページ取得処理部、３０９…文書表示処理部、４０１…作業メモリ、４０２…記憶媒体 17A ... LCD, 202 ... Digital notebook application, 301 ... Trajectory display processing unit, 302 ... Time series information generation unit, 303 ... Line recognition unit, 304 ... Character recognition unit, 305 ... Character group recognition unit, 306 ... Formatted document generation 307 ... Page storage processing unit, 308 ... Page acquisition processing unit, 309 ... Document display processing unit, 401 ... Work memory, 402 ... Storage medium

Claims

An acquisition means capable of acquiring character codes of a plurality of handwritten characters corresponding to the plurality of lines using data of a handwritten document including a plurality of handwritten characters corresponding to the plurality of lines;
When the first condition is satisfied, a character corresponding to at least one first character code is obtained by using a plurality of first character codes corresponding to the first row and a plurality of second character codes corresponding to the second row. The first formatted document data including the character corresponding to the second line or including the character corresponding to at least one second character code in the position corresponding to the first line can be displayed.
When the first condition is not satisfied, a plurality of characters corresponding to the plurality of first character codes are associated with the first row using the plurality of first character codes and the plurality of second character codes. Display control means capable of displaying second formatted document data including a plurality of characters corresponding to the plurality of second character codes in a position corresponding to the second row,
Whether or not the first condition is satisfied is as follows: (1) The character code corresponding to the first character of the first line is different from the character code corresponding to the first character of the second line. Or (2) whether or not the character code corresponding to the first character of the first line and the character code corresponding to the first character of the second line correspond to a symbol used in the bullets, or (3) The relationship between the horizontal position of the first row and the horizontal position of the second row in the case of horizontal writing, or (4) the vertical position of the first row in the case of vertical writing. (5) A plurality of characters corresponding to the plurality of first character codes and a plurality of characters corresponding to the plurality of second character codes correspond to a source code. Or (6) a plurality of characters corresponding to the plurality of first character codes. An electronic device defined by using at least one of whether or not a plurality of characters and a plurality of characters corresponding to the plurality of second character codes correspond to mathematical expressions.

The display control means includes
If the it is starve below threshold in the first row of the horizontal position and horizontal position of the second row, between the second character codes said plurality of first character codes of the plurality Without inserting a line feed code
When the difference between the horizontal position of the first row and the horizontal position of the second row is equal to or greater than the threshold value, the interval between the plurality of first character codes and the plurality of second character codes. The electronic device according to claim 1, wherein a line feed code is inserted in

The display control means includes
When the first character code of the plurality of first character codes and the first character code of the plurality of second character codes are a third character code, the plurality of first character codes and the Insert a line feed code between multiple second character codes,
When the first character code of the plurality of first character codes and the first character code of the plurality of second character codes are not the third character code, the plurality of first character codes and the The electronic device according to claim 1, wherein no line feed code is inserted between the plurality of second character codes.

The display control means further includes a plurality of the plurality of display blanks when the horizontal position of the first blank in the first row and the horizontal position of the second blank in the second row are within a first range. The character code of the character that follows the first space in the first character code and the character code of the character that follows the second space in the plurality of second character codes are the same horizontal The electronic device according to claim 1, wherein the second formatted document data arranged at the position in the direction can be displayed.

The acquisition means further determines a first font size based on the sizes of the plurality of first handwritten characters in the first row, and a second font based on the sizes of the plurality of second handwritten characters in the second row. Determine the size,
Before Symbol the first font size associated with the plurality of first character codes, the plurality of second the character code 請 Motomeko 1 wherein the second font size that is associated with the electronic device.

The display control means further includes, when the plurality of first character codes include a character code of a predetermined mathematical symbol, a plurality of characters corresponding to the plurality of first character codes corresponding to the first row. 2. The electronic device according to claim 1, wherein the second formatted document data including a plurality of characters corresponding to the plurality of second character codes at a position corresponding to the second row can be displayed.

The display control means further determines whether or not the plurality of first character codes are program source codes using data indicating specifications relating to a description of a predetermined program language, and the plurality of first characters When the code is a source code of a program, the plurality of characters corresponding to the plurality of first character codes are included at positions corresponding to the first line, and the plurality of characters corresponding to the plurality of second character codes are The electronic device according to claim 1, wherein the second formatted document data included in the position corresponding to the second row can be displayed.

A touch screen display;
The electronic device according to claim 1, wherein the data of the handwritten document includes a plurality of stroke data corresponding to a plurality of strokes based on a handwriting input operation using the touch screen display.

Using data of a handwritten document including a plurality of handwritten characters corresponding to a plurality of lines, obtaining character codes of a plurality of handwritten characters corresponding to the plurality of lines,
When the first condition is satisfied, a character corresponding to at least one first character code is obtained by using a plurality of first character codes corresponding to the first row and a plurality of second character codes corresponding to the second row. Displaying the first formatted document data including at a position corresponding to the second line, or including a character corresponding to at least one second character code at a position corresponding to the first line;
When the first condition is not satisfied, a plurality of characters corresponding to the plurality of first character codes are associated with the first row using the plurality of first character codes and the plurality of second character codes. Displaying second formatted document data including a plurality of characters corresponding to the plurality of second character codes in a position corresponding to the second line,
Whether or not the first condition is satisfied depends on whether (1) the character code corresponding to the first character in the first row matches the character code corresponding to the first character in the second row. Or (2) whether or not the character code corresponding to the first character of the first line and the character code corresponding to the first character of the second line correspond to a symbol used in the bullets, or (3) The relationship between the horizontal position of the first row and the horizontal position of the second row in the case of horizontal writing, or (4) the vertical position of the first row in the case of vertical writing. (5) A plurality of characters corresponding to the plurality of first character codes and a plurality of characters corresponding to the plurality of second character codes correspond to a source code. Or (6) a plurality of characters corresponding to the plurality of first character codes. The method in which a plurality of characters corresponding to the second character code of the character and the plurality is determined using at least one of whether corresponding to formula.

The first formed document data be displayed, when the a first row horizontal direction than a position said starve threshold and horizontal position of the second row of the said plurality of first character codes Do not insert a line feed code between multiple second character codes,
The second formatted document data is displayed when the horizontal position of the first row and the horizontal position of the second row are equal to or greater than the threshold value, and the plurality of first character codes The method according to claim 9, wherein a line feed code is inserted between the plurality of second character codes.

The display of the second formatted document data means that a first character code of the plurality of first character codes and a first character code of the plurality of second character codes are a third character code. If there is, a line feed code is inserted between the plurality of first character codes and the plurality of second character codes,
Displaying the first formatted document data means that a first character code of the plurality of first character codes and a first character code of the plurality of second character codes are the third character code. The line feed code is not inserted between the plurality of first character codes and the plurality of second character codes if not.

Displaying the second formatted document data further means that the horizontal position of the first blank in the first row and the horizontal position of the second blank in the second row are within the first range. The character code of the character that follows the first space in the plurality of first character codes, and the character code of the character that follows the second space in the plurality of second character codes. 10. The method according to claim 9, wherein the second formatted document data arranged at the same horizontal position is displayed.

The obtaining further determines a first font size based on the sizes of the plurality of first handwritten characters in the first row, and second based on the sizes of the plurality of second handwritten characters in the second row. Determine the font size,
Before Symbol the plurality of first character code associated the first font size, wherein the plurality of second character codes 請 Motomeko 9 The method according the second font size is that associated with it.

The display of the second formatted document data further includes a plurality of characters corresponding to the plurality of first character codes when the plurality of first character codes include a character code of a predetermined mathematical symbol. The method according to claim 9, wherein second formatted document data including a plurality of characters corresponding to the plurality of second character codes at a position corresponding to the second line is displayed at a position corresponding to one line.

Displaying the second formatted document data further determines whether or not the plurality of first character codes are program source codes using data indicating specifications relating to a description of a predetermined programming language; When the plurality of first character codes are program source codes, a plurality of characters corresponding to the plurality of first character codes are included at positions corresponding to the first line, and correspond to the plurality of second character codes. The method according to claim 9, wherein second formatted document data including a plurality of characters to be displayed at a position corresponding to the second line is displayed.

The method according to claim 9, wherein the data of the handwritten document includes a plurality of stroke data corresponding to a plurality of strokes based on a handwriting input operation using a touch screen display.

A program executed by a computer, wherein the program is
A procedure for acquiring character codes of a plurality of handwritten characters corresponding to the plurality of lines using data of a handwritten document including a plurality of handwritten characters corresponding to the plurality of lines;
When the first condition is satisfied, a character corresponding to at least one first character code is obtained by using a plurality of first character codes corresponding to the first row and a plurality of second character codes corresponding to the second row. A step of displaying first formatted document data including at a position corresponding to the second line, or including a character corresponding to at least one second character code at a position corresponding to the first line;
When the first condition is not satisfied, a plurality of characters corresponding to the plurality of first character codes are associated with the first row using the plurality of first character codes and the plurality of second character codes. Causing the computer to execute a procedure for displaying second formatted document data including a plurality of characters corresponding to the plurality of second character codes in a position corresponding to the second row,
Whether or not the first condition is satisfied depends on whether (1) the character code corresponding to the first character in the first row matches the character code corresponding to the first character in the second row. Or (2) whether or not the character code corresponding to the first character of the first line and the character code corresponding to the first character of the second line correspond to a symbol used in the bullets, or (3) The relationship between the horizontal position of the first row and the horizontal position of the second row in the case of horizontal writing, or (4) the vertical position of the first row in the case of vertical writing. (5) A plurality of characters corresponding to the plurality of first character codes and a plurality of characters corresponding to the plurality of second character codes correspond to a source code. Or (6) a plurality of characters corresponding to the plurality of first character codes. Characters and the program in which a plurality of characters corresponding to the plurality of second character codes are determined by using at least one of whether corresponding to formula.

The procedure for first displaying the formed document data, if the it is starve below threshold in the first row of the horizontal position and the position of the second row in the horizontal direction, the plurality of first character codes And no line feed code between the plurality of second character codes,
The procedure for displaying the second formatted document data is such that the difference between the horizontal position of the first row and the horizontal position of the second row is equal to or greater than the threshold value. The program according to claim 17, wherein a line feed code is inserted between the code and the plurality of second character codes.

The procedure for displaying the second formatted document data is such that the first character code of the plurality of first character codes and the first character code of the plurality of second character codes are a third character code. If there is, a line feed code is inserted between the plurality of first character codes and the plurality of second character codes,
The procedure for displaying the first formatted document data is such that the first character code of the plurality of first character codes and the first character code of the plurality of second character codes are the third character code. If not, a program according to claim 17, wherein no line feed code is inserted between the plurality of first character codes and the plurality of second character codes.

In the procedure of displaying the second formatted document data, the horizontal position of the first blank in the first row and the horizontal position of the second blank in the second row are within the first range. The character code of the character that follows the first space in the plurality of first character codes, and the character code of the character that follows the second space in the plurality of second character codes. 18. The program according to claim 17, wherein the second formatted document data arranged at the same horizontal position is displayed.

The obtaining step further includes determining a first font size based on the sizes of the plurality of first handwritten characters in the first row, and a second based on the sizes of the plurality of second handwritten characters in the second row. Determine the font size,
Before Symbol the plurality of first character code associated the first font size, wherein the plurality of second character codes said second font size is that請 Motomeko 17 wherein the program associated to.

In the procedure of displaying the second formatted document data, when the plurality of first character codes include a character code of a predetermined mathematical symbol, a plurality of characters corresponding to the plurality of first character codes are displayed. 18. The program according to claim 17, wherein the second formatted document data including a plurality of characters corresponding to the plurality of second character codes at a position corresponding to the second line is displayed at a position corresponding to one line.

The procedure of displaying the second formatted document data further determines whether or not the plurality of first character codes are source codes of the program, using data indicating specifications relating to a description of a predetermined programming language, When the plurality of first character codes are program source codes, a plurality of characters corresponding to the plurality of first character codes are included at positions corresponding to the first line, and correspond to the plurality of second character codes. The program according to claim 17, wherein second formatted document data including a plurality of characters to be displayed at a position corresponding to the second line is displayed.

The computer further comprises a touch screen display,
The program according to claim 17, wherein the handwritten document data includes a plurality of stroke data corresponding to a plurality of strokes based on a handwriting input operation using the touch screen display.