JP2022537169A

JP2022537169A - Handling handwritten text input in freehand writing mode

Info

Publication number: JP2022537169A
Application number: JP2021574329A
Authority: JP
Inventors: アランシャタイナー
Original assignee: マイスクリプト
Priority date: 2019-06-20
Filing date: 2020-06-19
Publication date: 2022-08-24
Anticipated expiration: 2040-06-19
Also published as: EP3754537A1; KR20220024146A; KR102576276B1; EP3754537B1; US20200401796A1; CN114402331A; JP7486131B2; US11687618B2; WO2020254600A1

Abstract

本発明は、フリーハンドライティングフォーマット（ＦＴ１）で、コンピュータデバイス上で入力されたデジタルインク（ＩＮ）のストロークを検出することと、前述のストロークからテキストブロック（ＢＬ１）を検出することと、前述のテキストブロックの各テキストラインでテキスト認識を実施することであって、テキストブロック（ＢＬ１）からテキストラインを抽出すること、ならびに、テキストブロックの各ストロークを、テキストブロック（ＢＬ１）の文字、ワード、及びテキストラインと関連付けるモデルデータを生成すること、を含む、実施することと、ドキュメントパターン（２００）に従うように、各テキストラインを、フリーハンドライティングフォーマット（ＦＴ１）から構造化されたフォーマット（ＦＴ２）に正規化することと、を含む方法に関する。正規化には、各テキストラインに関し、前述のテキストラインを構造化されたフォーマットに変換するように変換関数を計算することと、変換関数をテキストラインに適用することと、変換関数に基づき、前述のテキストラインのモデルデータをアップデートすることと、が含まれる場合がある。【選択図】図３The present invention detects strokes of digital ink (IN) input on a computing device in a freehand writing format (FT1), detects text blocks (BL1) from said strokes, and performing text recognition on each text line of a text block, extracting the text line from the text block (BL1); generating model data to associate with the text lines; and converting each text line from a freehand writing format (FT1) to a structured format (FT2) so as to follow the document pattern (200). normalizing. Normalization includes, for each text line, computing a transformation function to transform said text line into a structured format; applying the transformation function to the text line; Updating the model data for the text lines in the . [Selection drawing] Fig. 3

Description

本開示は概して、ユーザのテキストの手書き入力を認識することが可能であるコンピュータデバイスインターフェースの分野に関する。具体的には、本開示は、手書きテキストを認識及び編集するためのコンピュータデバイス及び対応する方法に関する。 TECHNICAL FIELD The present disclosure relates generally to the field of computer device interfaces capable of recognizing a user's handwriting input of text. Specifically, the present disclosure relates to computing devices and corresponding methods for recognizing and editing handwritten text.

コンピュータデバイスが継続して、日々の生活の中にますます偏在するようになってきている。コンピュータデバイスは、コンピュータデスクトップ、ラップトップ、タブレットＰＣ、ハイブリッドコンピュータ（２－ｉｎ－１）、ｅブックリーダー、携帯電話、スマートフォン、ウェアラブルコンピュータ（スマートウォッチ、スマートグラス／ヘッドセットを含む）、グローバルポジショニングシステム（ＧＰＳ）ユニット、事業デジタルアシスタント（ＥＤＡ）、パーソナルデジタルアシスタント（ＰＤＡ）、ゲームコンソールなど、様々な形態を取る場合がある。さらに、コンピュータデバイスは、車、トラック、農用施設、製造施設、建物の環境制御（たとえば、ライティング、ＨＶＡＣ）、ならびに、家庭及び商業用の器具など、車両及び設備に組み込まれている。 Computer devices continue to become more and more ubiquitous in everyday life. Computing devices include computer desktops, laptops, tablet PCs, hybrid computers (2-in-1), e-book readers, mobile phones, smart phones, wearable computers (including smart watches, smart glasses/headsets), global positioning systems (GPS) units, business digital assistants (EDAs), personal digital assistants (PDAs), game consoles, etc. Additionally, computing devices are embedded in vehicles and equipment such as cars, trucks, agricultural facilities, manufacturing facilities, building climate controls (eg, lighting, HVAC), and home and commercial appliances.

コンピュータデバイスの各タイプには、特定のコンピュータリソースが備えられており、所与の用途が定められている。コンピュータデバイスは概して、中央処理ユニット（ＣＰＵ）などの少なくとも１つの処理要素と、いくつかの形態のメモリと、入力及び出力デバイスと、を備えている。様々なコンピュータデバイス及びそれに次ぐ用途により、様々な入力デバイス及びインターフェースが、ユーザがそのコンピュータデバイスと相互作用することを可能にするようにすることが必要になる。 Each type of computing device is equipped with specific computing resources and is destined for a given use. A computing device generally includes at least one processing element, such as a central processing unit (CPU), some form of memory, and input and output devices. A variety of computing devices and subsequent uses require a variety of input devices and interfaces to allow users to interact with the computing device.

そのような入力デバイスの１つが、タッチスクリーンまたはタッチパッドなどの接触式表面である。この表面では、ユーザ入力が、ユーザの身体の部位（たとえば、指）または器具（たとえば、ペンもしくはスタイラス）と接触式表面との間の接触を通して受領される。別の入力デバイスは入力表面であり、この入力表面は、入力表面の上方でユーザによって行われるジェスチャを検知する。さらなる入力デバイスが、位置検出システムである。この位置検出システムは、接触または非接触の相互作用の、非接触の物理的または仮想的な表面との相対位置を検出する。 One such input device is a contact-based surface such as a touchscreen or touchpad. At this surface, user input is received through contact between a user's body part (eg, finger) or instrument (eg, pen or stylus) and the touch-sensitive surface. Another input device is an input surface that senses gestures made by a user over the input surface. A further input device is a position detection system. The position detection system detects the relative position of a contacting or non-contacting interaction with a non-contacting physical or virtual surface.

手書き認識は、テキストコンテンツ（たとえば、英数字）またはテキストではないコンテンツ（たとえば、形状、図）など、ユーザによって手で線描されたか手書きされた様々なタイプの入力要素を入力及び処理するために、コンピュータデバイス内で実施され得る。コンピュータデバイス上で入力がされると、入力要素は、通常、デジタルインクとして表示され、活字に組まれたバージョンに変換するように、手書き認識がされる。ユーザの手書き入力は、通常、リアルタイムの手書き認識システムまたは方法を使用して解釈される。このため、オンラインシステム（クラウドベースのソリューションなどを使用して実施される認識）またはオフラインシステムが使用される場合がある。 Handwriting recognition is used to input and process various types of input elements, such as textual content (e.g., alphanumeric) or non-textual content (e.g., shapes, drawings), hand-drawn or handwritten by a user. It can be implemented within a computing device. When input is made on the computing device, the input elements are typically displayed as digital ink and subjected to handwriting recognition to convert to a typeset version. A user's handwriting input is typically interpreted using a real-time handwriting recognition system or method. For this, online systems (recognition performed using cloud-based solutions or the like) or offline systems may be used.

ユーザ入力は、図であるか、または、任意の他のテキストのコンテンツ、テキストではないコンテンツ、もしくはテキストとテキストではないものとの混合のコンテンツである場合がある。手書き入力は、ユーザによる入力をガイドまたは制限するガイドライン（またはベースライン）に従って、構造を持つドキュメント上で実施される場合がある。代替的には、ユーザは、フリーハンドライティングモードで手書きする場合がある。すなわち、追従するべきラインまたは従うべき入力サイズのあらゆる制約を伴わない（たとえば、ブランクページ上）。 User input may be graphics or any other textual content, non-textual content, or a mixture of textual and non-textual content. Handwriting input may be performed on structured documents according to guidelines (or baselines) that guide or constrain input by a user. Alternatively, the user may handwrite in a freehand writing mode. That is, without any constraints on lines to follow or input sizes to follow (eg, on a blank page).

図１Ａは、適切なユーザインターフェースを使用して、フリーハンドライティングモードでユーザによって手で線描されるか手書きされるインク入力要素を表示するディスプレイデバイス１を備えている、コンピュータデバイス１の実施例を示している。このケースでは、コンピュータデバイス１は、テキストコンテンツ４及び６、ならびに、テキストではないコンテンツ８、１０、及び１２を検出及び表示する。これら要素の各々は、デジタルインクの１つまたは複数のストロークによって形成されている。入力要素は、たとえば、手書きテキスト、図、音楽の注釈などを含む場合がある。この実施例では、形状８は、矩形などであり、テキストコンテンツ６を包含するコンテナ（ボックス）を構成し、それにより、要素６と要素８との両方をともに選択及び操作できるようになっている。 FIG. 1A shows an embodiment of a computing device 1 with a display device 1 displaying ink input elements that are hand drawn or handwritten by a user in a freehand writing mode using a suitable user interface. showing. In this case, computing device 1 detects and displays text content 4 and 6 as well as non-text content 8, 10 and 12. FIG. Each of these elements is formed by one or more strokes of digital ink. Input elements may include, for example, handwritten text, drawings, musical annotations, and the like. In this example, shape 8, such as a rectangle, constitutes a container (box) containing text content 6, thereby allowing both elements 6 and 8 to be selected and manipulated together. .

手書き認識は、テキスト入力要素上、及び、場合によってはテキストではない入力要素上でも、実施される場合がある。さらに、各入力要素は、この実施例では図１Ｂに図示するように、活字に組み上がった入力要素として変換及び表示される場合がある。 Handwriting recognition may be performed on text input elements and possibly also on non-text input elements. Further, each input element may be converted and displayed as a typeset input element, as shown in FIG. 1B in this example.

手書き認識のアプリケーションでは、テキスト認識に関する性能は、特に手書きテキストがフリーハンドライティングモード（すなわち、フリーハンドライティングフォーマット）で入力されるケースでは、常に十分なものではない。テキスト認識プロセスの信頼性及び安定性の課題は、しばしば、そのようなコンピュータデバイスの性能を害し、それにより、全体のユーザの体験を制限する。制限は、新たなインクストロークが認識の以前の状態（すなわち、以前に入力されたインクストロークに基づく以前の認識の結果）に影響を及ぼした場合、及び、新たなインクストロークが、そのような認識の以前の状態に影響を及ぼさない場合（すなわち、新たなインクストロークが、以前に入力されたコンテンツに影響を及ぼさない新たなコンテンツに関する）を、コンピュータデバイスが判定する困難性から特に生じる場合がある。 In handwriting recognition applications, the performance for text recognition is not always satisfactory, especially in cases where handwritten text is entered in freehand writing mode (ie, freehand writing format). Reliability and stability issues of the text recognition process often impair the performance of such computing devices, thereby limiting the overall user experience. The restrictions are if the new ink stroke affected the previous state of recognition (i.e., the result of a previous recognition based on previously input ink strokes) and if the new ink stroke affects such recognition. In particular, it may result from the difficulty for computing devices to determine when a new ink stroke does not affect the previous state of the .

さらに、コンピュータデバイス上に表示されたユーザ入力に対し、いくらかのレベルの編集を実施することが通常は可能である。しかし、慣習的に、そのようなアプリケーションは、編集機能を扱うその能力が制限されており、通常は、行動を採用するように、または、ユーザの元々の意図を反映しない妥協を許容するように、ユーザを制限する。 Moreover, it is usually possible to perform some level of editing on the user input displayed on the computing device. Conventionally, however, such applications are limited in their ability to handle editing functions, usually to adopt behavior or tolerant of compromises that do not reflect the user's original intentions. , restrict users.

表示機能及び編集機能は、手書きテキストがフリーハンドライティングモードを使用して入力される場合に、コンピュータデバイス上でより具体的に制限される。本来、ライン、サイズ、向き、余白などの制約は、フリーハンドライティングモードではユーザに課されず、それにより、様々な複雑な形態の手書きが入力される場合がある。それにより、コンピュータデバイスによる編集機能を実施して、手書きテキストを操作すること（たとえば、移動する、拡縮する、修正する、テキストの流れの中にラインの切れ目を挿入する）の解釈がより困難になる。しかし、ユーザは、特にフリーハンドライティングモードが使用される場合に、より構造的かつ向上した方式で手書き入力を編集及び操作することを望む場合がある。フリーハンドライティングフォーマットで手書きテキスト入力を扱うことにおけるこれら制限は、ユーザの経験を妨げ、向上を必要とする。 Display and editing functions are more specifically limited on computing devices when handwritten text is entered using the freehand writing mode. Constraints such as line, size, orientation, margins, etc. are not inherently imposed on the user in the freehand writing mode, whereby various complex forms of handwriting may be input. This makes it more difficult to interpret manipulating handwritten text (e.g., moving, scaling, modifying, inserting line breaks in the flow of text) by performing editing functions with a computing device. Become. However, users may desire to edit and manipulate handwriting input in a more structured and enhanced manner, especially when freehand writing modes are used. These limitations in dealing with handwritten text input in a freehand writing format hinder the user experience and require improvement.

フリーハンドライティングモード（またはフォーマット）におけるテキストの手書き入力の、効率的かつ信頼性のある処理を可能にし、特に、テキストの認識を向上させ、コンピュータデバイス上でのそのような手書きテキストの効率的な編集を可能にする解決策が必要とされている。 enabling efficient and reliable processing of handwritten input of text in freehand writing modes (or formats), in particular improving recognition of text, and efficient handling of such handwritten text on computer devices; A solution that allows editing is needed.

以下に記載される本発明の実施例は、ユーザが入力した手書きテキストを編集するためのコンピュータデバイス、方法、及び対応するコンピュータプログラムを提供する。 Embodiments of the present invention described below provide computer devices, methods, and corresponding computer programs for editing handwritten text entered by a user.

特定の態様によれば、本発明は、手書きテキストを処理するために、コンピュータデバイスによって実施される方法であって、
入力表面でデジタルインクの複数の入力ストロークを検出することであって、前述の入力ストロークが、いずれの手書きの制約も伴わずに、フリーハンドライティングフォーマットで入力される、検出することと、
前述の複数の入力ストロークをディスプレイデバイス上に、前述のフリーハンドライティングフォーマットで表示することと、
テキストまたはテキストではないものとして各入力ストロークを分類することであって、前述の分類することが、フリーハンドライティングフォーマットで手書きされた前述の入力ストロークから、手書きテキストの少なくとも１つのテキストブロックをテキストとして検出することを含む、分類することと、
前述の少なくとも１つのテキストブロックでテキスト認識を実施することであって、前述のテキスト認識が、
前述の少なくとも１つのテキストブロックから手書きテキストのテキストラインを抽出すること、ならびに、
前述の少なくとも１つのテキストブロックの各ストロークを、前述の少なくとも１つのテキストブロックの文字、ワード、及びテキストラインと関連付けるモデルデータを生成すること、
を含む、実施することと、
ドキュメントパターンのラインパターンに従うように、手書きテキストの各テキストラインを、フリーハンドライティングフォーマットから構造化されたハンドライティングフォーマットに正規化することであって、前述の正規化が、各テキストラインに関し、
前述のテキストラインを構造化されたハンドライティングフォーマットに変換するように、前述のテキストラインに関し、それぞれの変換関数を計算すること、
前述のテキストラインの各ストロークを構造化されたハンドライティングフォーマットに変換するように、それぞれの変換関数を適用すること、及び、
それぞれの変換関数に基づき、前述のテキストラインのモデルデータをアップデートすること、
を含む、正規化することと、
を含む方法を提供する。 According to certain aspects, the invention is a method implemented by a computing device for processing handwritten text, comprising:
detecting a plurality of input strokes of digital ink on an input surface, said input strokes being entered in a freehand writing format without any handwriting constraints;
displaying said plurality of input strokes on a display device in said freehand writing format;
classifying each input stroke as text or non-text, wherein said classifying comprises classifying at least one text block of handwritten text as text from said input stroke handwritten in a freehand writing format; classifying, including detecting;
performing text recognition on said at least one text block, said text recognition comprising:
extracting text lines of handwritten text from said at least one text block;
generating model data associating each stroke of said at least one text block with characters, words and text lines of said at least one text block;
performing, including
normalizing each text line of handwritten text from a freehand writing format to a structured handwriting format so as to follow the line pattern of the document pattern, said normalization for each text line:
calculating respective transformation functions for said lines of text to transform said lines of text into a structured handwriting format;
applying a respective transformation function to transform each stroke of said text line into a structured handwriting format; and
Updating said text line model data based on respective transformation functions;
normalizing, including
to provide a method comprising:

特定の実施形態では、本方法は、前述のテキスト認識の間に生成されたモデルデータを記憶することを含み、
前述のモデルデータをアップデートすることが、前述の少なくとも１つのテキストブロックのアップデートされたモデルデータを、前述のテキスト認識の間に生成されたモデルデータと置き換えて記憶することをさらに含む。 In certain embodiments, the method includes storing model data generated during said text recognition,
Updating said model data further comprises storing said updated model data for said at least one text block replacing model data generated during said text recognition.

特定の実施形態では、本方法は、前述の正規化することの後に、構造化されたハンドライティングフォーマットで前述の少なくとも１つのテキストブロックのテキストラインを表示することを含む。 In certain embodiments, the method includes displaying text lines of said at least one text block in a structured handwriting format after said normalizing.

特定の実施形態では、少なくとも１つのテキストブロックのモデルデータは、
複数の文字を規定する文字情報であって、各文字が、デジタルインクの少なくとも１つのストローク、及び、少なくとも１つのテキストブロックのテキストラインと関連付けられている、文字情報と、
複数のワードを規定するワード情報であって、各ワードが、文字情報によって規定された少なくとも１つの文字と関連付けられている、ワード情報と、
少なくとも１つのテキストブロックの各テキストラインを規定するライン情報であって、各テキストラインが、ワード情報によって規定された少なくとも１つのワードと関連付けられている、ライン情報と、
を含む。 In certain embodiments, the model data for at least one text block includes:
character information defining a plurality of characters, each character being associated with at least one stroke of digital ink and at least one text line of a text block;
word information defining a plurality of words, each word associated with at least one character defined by the character information;
line information defining each text line of the at least one text block, each text line being associated with at least one word defined by the word information;
including.

特定の実施形態では、ライン情報は、前述の少なくとも１つのテキストブロックの各テキストラインに関し、
テキストラインの原点を示す原点座標と、
テキストラインの傾斜を示す傾斜情報と、
テキストラインの高さを示す高さ情報と、
を含む。 In a particular embodiment, the line information relates to each text line of said at least one text block,
origin coordinates indicating the origin of the text line;
slant information indicating the slant of the text line;
height information indicating the height of the text line;
including.

特定の実施形態では、前述の正規化の間に前述のモデルデータをアップデートすることは、それぞれの変換関数に基づき、前述のテキストラインのライン情報をアップデートすることを含む。 In certain embodiments, updating said model data during said normalization includes updating line information for said lines of text based on respective transformation functions.

特定の実施形態では、前述の正規化は、各テキストラインに関し、
前述のテキストラインの原点座標、傾斜情報、及び高さ情報を含む入力パラメータを判定することを含み、
それぞれの変換関数が、入力パラメータ及びドキュメントパターンに基づいて計算される。 In a particular embodiment, the aforementioned normalization includes, for each line of text,
determining input parameters including origin coordinates, skew information, and height information for said text line;
Each transform function is computed based on the input parameters and the document pattern.

特定の実施形態では、ドキュメントパターンは、
表示エリアの余白と、
ライン間の距離と、
の、手書きテキストが従う手書きの制約の少なくとも１つを規定する。 In certain embodiments, the document pattern is
the margins of the display area,
the distance between the lines and
defines at least one of the handwritten constraints that the handwritten text follows.

特定の実施形態では、各変換関数は、
平行移動成分と、
拡縮成分と、
回転成分と、
の、前述の正規化の間にそれぞれのテキストラインに適用される変換成分の少なくとも１つを規定する。 In certain embodiments, each transformation function is
a translational component;
a scaling component;
a rotational component;
, defines at least one of the transformation components to be applied to each text line during said normalization.

特定の実施形態では、ドキュメントパターンは、ガイドラインを規定する前述のラインパターンを含み、このガイドラインに従って、手書きテキストが構造化されたハンドライティングフォーマットで配置されることになる。 In certain embodiments, the document pattern includes the aforementioned line patterns that define guidelines according to which handwritten text is to be laid out in a structured handwriting format.

特定の実施形態では、変換関数の拡縮成分は、前述の正規化の間に、前述のラインパターンの２つの連続したガイドライン間の距離の、それぞれのテキストラインの高さに対する割合に基づいて判定される。 In a particular embodiment, the scaling component of the transformation function is determined during said normalization based on the ratio of the distance between two consecutive guidelines of said line pattern to the height of the respective text line. be.

特定の実施形態では、変換関数の平行移動成分は、前述の正規化の間に、前述のテキストラインの原点が、前述の正規化の間に前述のテキストラインにアサインされたラインパターンの対応するガイドラインに整列されるように移動されるように、テキストラインの平行移動を実施するように判定される。 In a particular embodiment, the translation component of the transformation function is such that during said normalization the origin of said text line corresponds to the line pattern assigned to said text line during said normalization. It is determined to perform a translation of the text line so that it is moved so that it is aligned with the guidelines.

特定の実施形態では、前述の回転成分は、前述の正規化の間に、それぞれのテキストラインを回転させ、それにより、ドキュメントパターンに従ってそれぞれのテキストラインの傾斜をゼロに低減するように判定される。 In a particular embodiment, said rotation component is determined during said normalization to rotate each text line, thereby reducing the skew of each text line to zero according to the document pattern. .

特定の実施形態では、前述の正規化の間に、各テキストラインのモデルデータは、それぞれの変換関数に従ってアップデートされ、一方、それぞれの変換関数の前述の適用から生じ得るいずれのテキスト認識をもブロックする。 In certain embodiments, during said normalization, each text line's model data is updated according to its respective transformation function, while blocking any text recognition that may result from said application of the respective transformation function. do.

別の態様によれば、本発明は、本文献に規定される本発明の方法の各ステップを実行するための命令を含む、コンピュータ可読プログラムコード（またはコンピュータプログラム）を上に記録した非一時的なコンピュータ可読媒体に関する。 According to another aspect, the present invention provides a non-transitory program code (or computer program) recorded thereon comprising instructions for performing the steps of the method of the invention as defined in this document. computer readable medium.

本発明のコンピュータプログラムは、任意のプログラミング言語で表現することができ、ソースコード、オブジェクトコード、または、たとえば、部分的にコンパイルされた形態、または、任意の他の適切な形態になるように、ソースコードとオブジェクトコードとの間の任意の中間のコードの形態とすることができる。 The computer program of the present invention may be expressed in any programming language, and may be in source code, object code or, for example, partially compiled form or any other suitable form. It can be in the form of any intermediate code between source code and object code.

本発明は、上述のコンピュータプログラムをも提供する。 The invention also provides a computer program as described above.

前述の非一時的なコンピュータ可読媒体は、コンピュータプログラムを記憶することができる任意のエンティティまたはデバイスとすることができる。たとえば、記録媒体は、ＲＯＭメモリ（ＣＤ－ＲＯＭもしくはマイクロ電子回路で実施されるＲＯＭ）などの記憶手段、または、たとえばフロッピーディスクもしくはハードディスクなどの磁気記憶手段を含むことができる。 The aforementioned non-transitory computer-readable medium may be any entity or device capable of storing a computer program. For example, the recording medium may comprise storage means such as ROM memory (CD-ROM or ROM implemented in microelectronic circuits) or magnetic storage means such as for example a floppy disk or hard disk.

本発明の非一時的なコンピュータ可読媒体は、電気信号または光学信号などの伝達可能な媒体に対応することができる。この媒体は、電気ケーブルもしくは光ケーブルを介して、または無線もしくは任意の他の適切な手段によって搬送することができる。本開示に係るコンピュータプログラムは、特に、インターネットまたはネットワークなどからダウンロードすることができる。 A non-transitory computer-readable medium of the present invention can correspond to a transmissible medium such as an electrical or optical signal. This medium may be conveyed via electrical or optical cables, wirelessly or by any other suitable means. A computer program according to the present disclosure can be downloaded from, among others, the Internet or a network.

代替的には、非一時的なコンピュータ可読媒体は、集積回路に対応することができる。この集積回路には、コンピュータプログラムが導入されており、回路が、本発明の方法の実行において実行するか使用されるために適合されている。 Alternatively, a non-transitory computer-readable medium may correspond to an integrated circuit. A computer program is embodied in this integrated circuit and the circuit is adapted to be executed or used in carrying out the method of the invention.

特定の実施形態では、本発明は、内部に埋め込まれたコンピュータ可読プログラムコードを有する非一時的なコンピュータ可読媒体に関し、前述のコンピュータ可読プログラムコードは、本文献で規定されるコンピュータデバイス上の手書きの入力要素に関する方法を実施するように実行されるために適合されており、コンピュータデバイスは、前述の方法の各ステップを実行するためのプロセッサを備えている。 In certain embodiments, the present invention relates to a non-transitory computer-readable medium having computer-readable program code embedded therein, said computer-readable program code being stored in a hand-written form on a computer device as defined herein. A computing device adapted to be executed to perform the method relating to the input elements comprises a processor for performing the steps of the aforementioned method.

本発明は、本開示に規定される方法を実施するために適切なコンピュータデバイスにも関する。より具体的には、本発明は、手書きテキストのためのコンピュータデバイスであって、
デジタルインクの複数のストロークを検出するための入力表面であって、前述のストロークが、いずれの手書きの制約も伴わずに、フリーハンドライティングフォーマットで入力される、入力表面と、
前述の複数の入力ストロークを前述のフリーハンドライティングフォーマットで表示するためのディスプレイデバイスと、
テキストまたはテキストではないものとして各ストロークを分類するための分類器であって、前述の分類器が、フリーハンドライティングフォーマットで手書きされた前述の入力ストロークから、手書きテキストの少なくとも１つのテキストブロックをテキストとして検出するように構成されている、分類器と、
前述の少なくとも１つのテキストブロックから手書きテキストのテキストラインを抽出するためのライン抽出器と、
前述の少なくとも１つのテキストブロックの各テキストラインのテキスト認識を実施し、それにより、前述の少なくとも１つのテキストブロックの各ストロークを、前述の少なくとも１つのテキストブロックの文字、ワード、及びテキストラインと関連付けるモデルデータを生成するための認識エンジンと、
ドキュメントパターンのラインパターンに従うように、手書きテキストの各テキストラインを、フリーハンドライティングフォーマットから構造化されたハンドライティングフォーマットに正規化するためのテキストエディタであって、前述のテキストエディタが、各テキストラインに関し、
前述のテキストラインを構造化されたハンドライティングフォーマットに変換するように、前述のテキストラインに関し、それぞれの変換関数を計算すること、
前述のテキストラインの各ストロークを構造化されたハンドライティングフォーマットに変換するように、それぞれの変換関数を適用すること、及び、
それぞれの変換関数に基づき、前述のテキストラインのモデルデータをアップデートすること、
を実施するように構成されている、テキストエディタと、
を備えている、コンピュータデバイスを提供する。 The invention also relates to computer devices suitable for carrying out the methods set forth in this disclosure. More specifically, the present invention is a computer device for handwritten text, comprising:
an input surface for detecting multiple strokes of digital ink, wherein said strokes are entered in a freehand writing format without any handwriting constraints;
a display device for displaying said plurality of input strokes in said freehand writing format;
A classifier for classifying each stroke as text or non-text, wherein said classifier converts at least one text block of handwritten text from said input strokes handwritten in a freehand writing format into text. a classifier configured to detect as
a line extractor for extracting text lines of handwritten text from said at least one text block;
performing text recognition of each text line of said at least one text block, thereby associating each stroke of said at least one text block with a character, word and text line of said at least one text block; a recognition engine for generating model data;
A text editor for normalizing each text line of handwritten text from a freehand writing format to a structured handwriting format so as to follow the line pattern of the document pattern, said text editor for normalizing each text line Regarding
calculating respective transformation functions for said lines of text to transform said lines of text into a structured handwriting format;
applying a respective transformation function to transform each stroke of said text line into a structured handwriting format; and
Updating said text line model data based on respective transformation functions;
a text editor configured to implement
A computing device is provided, comprising:

本発明の方法と関連して上の規定された様々な実施形態は、本開示のコンピュータデバイス、コンピュータプログラム、及び非一時的なコンピュータ可読媒体と類似の方式で適用される。 The various embodiments defined above in connection with the method of the present invention apply in analogous fashion to the computer device, computer program and non-transitory computer readable medium of the present disclosure.

本開示に規定される本発明の方法の各ステップに関し、コンピュータデバイスは、前述のステップを実施するように構成された対応するモジュールを備えている場合がある。 For each step of the method of the invention defined in this disclosure, the computing device may comprise a corresponding module configured to perform the aforementioned steps.

特定の実施形態では、本開示は、ソフトウェア及び／またはハードウェア構成要素を使用して実施される場合がある。この文脈では、「モジュール」との用語は、本開示では、ソフトウェア構成要素及びハードウェア構成要素、または複数のソフトウェア及び／またはハードウェア構成要素に言及することができる。 In certain embodiments, the present disclosure may be implemented using software and/or hardware components. In this context, the term "module" can refer to a software component and a hardware component, or multiple software and/or hardware components in this disclosure.

本開示の他の特徴及び利点は、限定的特徴を有していない実施形態を示す添付図面を参照してされる以下の詳細な説明から明らかとなるであろう。 Other features and advantages of the present disclosure will become apparent from the following detailed description, taken with reference to the accompanying drawings, which show embodiments having no limiting features.

ＡおよびＢは慣習的な構成に係るデジタルデバイスを示す。A and B show digital devices according to conventional configurations. 本発明の特定の実施形態に係るコンピュータデバイスを概略的に示すブロック図である。1 is a block diagram that schematically illustrates a computing device according to certain embodiments of the invention; FIG. 本発明の特定の実施形態に係る正規化プロセスを概略的に示す図である。Fig. 3 schematically illustrates a normalization process according to certain embodiments of the present invention; 本発明の特定の実施形態に係る、図２のコンピュータデバイスによって実施されるモジュールを概略的に示すブロック図である。3 is a block diagram that schematically illustrates modules implemented by the computing device of FIG. 2, in accordance with certain embodiments of the present invention; FIG. 本発明の特定の実施形態に係る方法のステップを概略的に示すフロー図である。Figure 3 is a flow diagram that schematically illustrates the steps of a method according to certain embodiments of the invention; 本発明の特定の実施形態に係る、図２のコンピュータデバイスによって処理される間の手書きテキストを概略的に示す図である。3 schematically illustrates handwritten text during processing by the computing device of FIG. 2, in accordance with certain embodiments of the present invention; FIG. 本発明の特定の実施形態に係る、図２のコンピュータデバイスによって処理される間の手書きテキストを概略的に示す図である。3 schematically illustrates handwritten text during processing by the computing device of FIG. 2, in accordance with certain embodiments of the present invention; FIG. 本発明の特定の実施形態に係る、図２のコンピュータデバイスによって処理される間の手書きテキストを概略的に示す図である。3 schematically illustrates handwritten text during processing by the computing device of FIG. 2, in accordance with certain embodiments of the present invention; FIG. 本発明の特定の実施形態に係る、図２のコンピュータデバイスによって生成されたモデルデータを概略的に示す図である。3 schematically illustrates model data generated by the computing device of FIG. 2, in accordance with certain embodiments of the present invention; FIG. 本発明の特定の実施形態に係る、図２のコンピュータデバイスによって正規化された手書きテキストを概略的に示す図である。3 schematically illustrates handwritten text normalized by the computing device of FIG. 2, according to certain embodiments of the present invention; FIG. 本発明の特定の実施形態に係る、正規化プロセスの間に手書きテキストに適用される編集操作を示す図である。FIG. 4 illustrates editing operations applied to handwritten text during the normalization process, according to certain embodiments of the present invention; 本発明の特定の実施形態に係る、正規化の後に編集される手書きテキストを概略的に示す図である。FIG. 4 schematically illustrates handwritten text edited after normalization, according to certain embodiments of the present invention;

図中の構成要素は、必ずしも拡縮されておらず、代わりに、本発明の原理の説明が強調されている。 The components in the figures are not necessarily to scale, emphasis instead being on explaining the principles of the invention.

説明の簡略化及び明確化のために、同じ参照符号は、別様に示されない限り、同じであるか類似のパーツを参照するために、図面を通して使用されることになる。 For simplicity and clarity of description, the same reference numbers will be used throughout the drawings to refer to the same or similar parts unless otherwise indicated.

以下の詳細な説明では、複数の特定の詳細が、関連する教示を完全に理解させるために、例として説明される。しかし、当業者には、本教示が、そのような詳細を伴わずに実施され得ることが明らかとなるであろう。他の例では、よく知られている方法、手順、及び／または構成要素が、本教示の態様を不必要に妨げることを避けるために、詳細を伴わずに、比較的高レベルで記載される。 In the following detailed description, numerous specific details are set forth by way of example in order to provide a thorough understanding of the related teachings. However, it will be apparent to those skilled in the art that the present teachings may be practiced without such details. In other instances, well-known methods, procedures, and/or components are described at a relatively high level without detail in order to avoid unnecessarily obscuring aspects of the present teachings. .

例示的実施形態の以下の詳細な説明は、添付図面を参照する。以下の詳細な説明は、本発明を限定するものではない。その代わりに、本発明の範囲は、添付の特許請求の範囲によって規定されている。図面に示すような様々な実施形態では、コンピュータデバイス、対応する方法、及び対応するコンピュータプログラムが論じられる。 The following detailed description of illustrative embodiments refers to the accompanying drawings. The following detailed description does not limit the invention. Instead, the scope of the invention is defined by the appended claims. In various embodiments as illustrated in the drawings, computing devices, corresponding methods, and corresponding computer programs are discussed.

本記載における「テキスト」との用語の使用は、記載のテキストにおいて使用される任意の記載の言語及び任意のシンボルにおいて、すべての文字（たとえば、英数字など）及びそのストリングを包含するものと理解される。 The use of the term "text" in this description is understood to encompass all characters (eg, alphanumeric characters, etc.) and strings thereof in any language of description and any symbols used in the text of description. be done.

本記載における「テキストではない」との用語は、自由な形態の手書きされたコンテンツ、または手で線描されたコンテンツ（たとえば、形状、描写など）及び画像データ、ならびに文字、及びそのストリング、またはテキストではない文脈において使用されるシンボルを包含するものと理解される。テキストではないコンテンツは、コンテナ、描画、一般的な形状（たとえば、矢印、ブロックなど）などを含む、線形または非線形の構成での、グラフィックまたは幾何学的な構造を規定する。図では、たとえば、テキストコンテンツは、コンテナと呼ばれるある形状（矩形、楕円形、卵形の形状など）に包含されている場合がある。 The term “non-text” in this description includes free-form handwritten content or hand-drawn content (e.g., shapes, drawings, etc.) and image data, as well as characters and strings thereof, or text is understood to encompass symbols used in contexts other than Non-textual content defines graphical or geometric structures in linear or non-linear configurations, including containers, drawings, general shapes (eg, arrows, blocks, etc.), and the like. In a diagram, for example, textual content may be contained in some shape (rectangular, oval, oval shape, etc.) called a container.

さらに、これら図に示される実施例は、左から右に書かれる言語の文脈であり、したがって、位置に対するあらゆる参照は、異なる方向のフォーマットを有する記載された言語に関して適応させることができる。 Further, the examples shown in these figures are in the context of languages written left to right, so any references to locations can be adapted for written languages having different directional formats.

本明細書に記載の様々な技術は、概して、ポータブル、及びポータブルではないコンピュータデバイス上の、手で描かれるか手で書かれるコンテンツの取得、処理、及び管理に関する。本明細書に記載のシステム及び方法は、（後に論じるような）接触式スクリーンなどの入力表面を介しての、コンピュータデバイスへのユーザの自然な筆記スタイル及び線描スタイルの入力の認識を利用する場合がある。様々な実施形態が、いわゆるオンライン認識技術を使用してデジタルインクの手書き入力の認識に関して記載されるが、認識を実施するためにリモートデバイスまたはサーバを伴うオフラインの認識など、認識のための入力の他の形態に適用することが可能であることを理解されたい。 The various techniques described herein relate generally to the acquisition, processing, and management of hand-drawn or hand-written content on portable and non-portable computing devices. The systems and methods described herein take advantage of the recognition of a user's natural writing and drawing style input to a computing device via an input surface such as a touch screen (as discussed below). There is Although various embodiments are described in terms of recognizing handwritten input in digital ink using so-called online recognition techniques, the input for recognition can be recognized, such as offline recognition with a remote device or server to perform the recognition. It should be understood that other forms of application are possible.

「手での線描」及び「手書き」との用語は、入力表面上で、または入力表面を用いて、ユーザの手（もしくは指）、または入力デバイス（ハンドヘルドスタイラスもしくはデジタルペン、マウスなど）の使用を通して、ユーザによるデジタルコンテンツ（手書き入力）の形成を規定するために、本明細書において相互交換可能に使用される。「手」などの用語は、入力技術の簡潔な記載を提供するために本明細書で使用されるが、足、口、及び目など、類似の入力のためにユーザの身体の他の部分を使用することが、この定義に含まれる。 The terms "hand drawing" and "handwriting" refer to the use of a user's hand (or fingers) or input device (such as a handheld stylus or digital pen, mouse, etc.) on or with an input surface. are used interchangeably herein to define the formation of digital content (handwriting input) by a user through the Terms such as "hands" are used herein to provide a concise description of input techniques, but other parts of the user's body are used for similar input, such as feet, mouth, and eyes. included in this definition.

以下により詳細に記載するように、本発明の態様は、フリーハンドライティングフォーマットで、コンピュータデバイス上で入力されるデジタルインクの手書きテキストを検出することと、手書きテキスト入力（または入力された手書きテキスト）を示すモデルデータを生成することを伴う、テキスト認識を実施することと、ドキュメントパターンに従う（たとえば、ドキュメントパターンのラインパターンに従う）ように、手書きテキスト入力の各テキストラインの、フリーハンドライティングフォーマットから構造化されたハンドライティングフォーマットへの正規化を実施することと、に依存する。この正規化は、手書きテキスト入力を構造化されたハンドライティングフォーマットに変えるために、手書きテキスト入力に編集する変換（たとえば、平行移動、回転、及び／または拡縮）を実施することを含んでいる。モデルデータは、正規化の目的のために、手書きテキスト入力に適用される編集する変換に応じてアップデートされもする。換言すると、正規化を実施する際に手書きテキスト入力のそれぞれのテキストラインの各々に適用される変換関数は、テキストラインのモデルデータをアップデートするベースとしても使用される。特定の実施形態において以下に説明するように、モデルデータは、手書きテキスト入力の各ストロークの、手書きテキスト入力のそれぞれの文字、それぞれのワード、及びそれぞれのテキストラインとの間の相関関係を規定する。 As will be described in more detail below, aspects of the present invention include detecting handwritten text in digital ink entered on a computing device in a freehand writing format, and handwritten text input (or input handwritten text). and constructing each text line of the handwritten text input from the freehand writing format to follow the document pattern (e.g., follow the line pattern of the document pattern) and performing normalization to a standardized handwriting format. This normalization includes performing editing transformations (e.g., translation, rotation, and/or scaling) on the handwritten text input to turn the handwritten text input into a structured handwriting format. The model data is also updated according to the editing transformations applied to the handwritten text input for normalization purposes. In other words, the transformation function applied to each respective text line of the handwritten text input in performing normalization is also used as a basis for updating the text line's model data. As described below in certain embodiments, the model data defines the correlation between each stroke of the handwritten text input, each character, each word, and each line of text of the handwritten text input. .

正規化プロセスの一部としてモデルデータをアップデートすることにより、デジタルインクの手書きテキスト入力を、（フリーハンドライティングフォーマットでの）制約のない環境から、（構造化されたハンドライティングフォーマットでの）フォーマットされた環境に効率的に変換することができ、それにより、よりよい表示、より信頼性のあるテキスト認識、及び、手書きテキスト入力の編集などのより広範囲の操作が可能になる。以下にさらに説明するように、正規化プロセスが入力インクの変更に繋がるが、あらゆる以前のテキスト認識状態は保存される。このことは、手書きテキスト入力及びモデルデータが同じ変換を使用してアップデートされることから、達成することができる。 By updating the model data as part of the normalization process, digital ink handwritten text input can be transformed from an unconstrained environment (in a freehand writing format) to a formatted environment (in a structured handwriting format). environment, which allows for better display, more reliable text recognition, and a wider range of manipulations such as editing handwritten text input. As explained further below, the normalization process leads to changes in the input ink, but any previous text recognition state is preserved. This can be achieved because the handwritten text input and the model data are updated using the same transform.

本発明では、正規化を経る各テキストラインは、手書きテキストのテキストラインである。以下にさらに記載するように、いくつかの変換がこの手書きテキストラインに、正規化プロセスの一部として適用されるが、これらテキストラインは、正規化されると（すなわち、正規化されたテキストライン）、依然として手書きテキストラインであり、すなわち、テキストラインとして、しかし正規化された方式で（たとえば、入力ストロークによって形成された手書きを構成しない活字に組み上がったコンテンツとは反対に）配置される手書きのものである。以下にさらに記載するように、手書きテキストの各テキストラインのストロークは、対応する手書きテキストラインを正規化された手書きテキストラインに変える変換を規定する変換関数の影響を受ける。 In the present invention, each text line that undergoes normalization is a text line of handwritten text. Several transformations are applied to the handwritten text lines as part of the normalization process, as described further below, but once these text lines are normalized (i.e., normalized text lines ), which are still handwritten text lines, i.e., handwriting placed as text lines but in a normalized manner (e.g., as opposed to typeset content that does not constitute handwriting formed by the input strokes). belongs to. As further described below, the strokes of each text line of handwritten text are subjected to a transformation function that defines a transformation that transforms the corresponding handwritten text line into a normalized handwritten text line.

図２は、本発明の特定の実施形態に係るコンピュータデバイス１００のブロック図を示している。コンピュータデバイス（またはデジタルデバイス）１００は、コンピュータデスクトップ、ラップトップコンピュータ、タブレットコンピュータ、ｅブックリーダー、携帯電話、スマートフォン、ウェアラブルコンピュータ、デジタルウォッチ、電子黒板、グローバルポジショニングシステム（ＧＰＳ）ユニット、事業デジタルアシスタント（ＥＤＡ）、パーソナルデジタルアシスタント（ＰＤＡ）、ゲームコンソールなどである場合がある。コンピュータデバイス１００は、少なくとも１つの処理要素、いくつかの形態のメモリ、ならびに入力及び出力（Ｉ／Ｏ）デバイスの構成要素を含んでいる。各構成要素は、コネクタ、ライン、バス、リンクネットワーク、または当業者に既知の他のものなどの入力及び出力を通して互いに通信する。 FIG. 2 illustrates a block diagram of computing device 100 in accordance with certain embodiments of the present invention. Computing devices (or digital devices) 100 include computer desktops, laptop computers, tablet computers, e-book readers, mobile phones, smart phones, wearable computers, digital watches, electronic whiteboards, global positioning system (GPS) units, business digital assistants ( EDA), personal digital assistant (PDA), game console, and the like. Computing device 100 includes components of at least one processing element, some form of memory, and input and output (I/O) devices. Each component communicates with each other through inputs and outputs such as connectors, lines, buses, link networks, or others known to those skilled in the art.

より明確には、コンピュータデバイス１００は、以下にさらに記載するように、テキスト及びテキストではない要素を含む、手での線描（または手書き）入力要素に関する入力表面１０４を備えている。より具体的には、入力表面１０４は、前述の入力表面上に入力されたデジタルインクの複数の入力ストロークを検出することに適している。以下でもさらに論じるように、これら入力ストロークは、フリーハンドライティングフォーマット（または、フリーハンドライティングモード）での入力である、すなわち、入力エリアにおける位置、サイズ、及び向きのあらゆる手書きの制約を伴わない場合がある。 More specifically, computing device 100 includes an input surface 104 for hand-drawn (or handwritten) input elements, including text and non-text elements, as further described below. More specifically, the input surface 104 is suitable for detecting multiple input strokes of digital ink entered on said input surface. As discussed further below, these input strokes are input in a freehand writing format (or freehand writing mode), i.e. without any handwritten constraints of position, size and orientation in the input area. There is

入力表面１０４は、接触感知表面または近接感知表面の形態のユーザ入力を受領するように、抵抗変化型、表面音波、静電容量、赤外グリッド、赤外線のアクリルの投影、光学イメージング、分散信号技術、音響パルス認識、または当業者に既知であるような任意の他の適切な技術などの技術を採用する場合がある。入力表面１０４は、位置検出システムによって監視されている非接触式感知表面である場合がある。 Input surface 104 may be resistive, surface acoustic, capacitive, infrared grid, infrared acrylic projection, optical imaging, distributed signal technology to receive user input in the form of a touch sensitive surface or a proximity sensitive surface. , acoustic pulse recognition, or any other suitable technique as known to those skilled in the art may be employed. Input surface 104 may be a non-contact sensing surface monitored by a position detection system.

コンピュータデバイス１００は、画像、テキスト、及びビデオなど、コンピュータデバイスからデータを出力するための少なくとも１つのディスプレイデバイス（またはディスプレイ）１０２をも備えている。ディスプレイデバイス１０２は、任意の適切な技術（ＬＣＤ、プラズマなど）のスクリーンなどである場合がある。以下にさらに記載するように、ディスプレイデバイス１０２は、デジタルインクで入力要素を表示するために適しており、各入力要素は、デジタルインクの少なくとも１つのストロークで形成されている。具体的には、ディスプレイデバイス１０２は、たとえば上述のフリーハンドライティングフォーマットで、入力表面１０４で入力された複数のストロークを表示する場合がある。 Computing device 100 also includes at least one display device (or display) 102 for outputting data from the computing device, such as images, text, and video. Display device 102 may be a screen or the like of any suitable technology (LCD, plasma, etc.). As described further below, the display device 102 is suitable for displaying input elements in digital ink, each input element being formed with at least one stroke of digital ink. Specifically, display device 102 may display a plurality of strokes entered on input surface 104, for example, in the freehand writing format described above.

入力表面１０４は、ディスプレイデバイス１０２と同じ場所に配置されるか、リモートでディスプレイデバイス１０２に接続される場合がある。特定の実施例では、ディスプレイデバイス１０２及び入力表面１０４は、タッチスクリーンの一部である。 The input surface 104 may be co-located with the display device 102 or remotely connected to the display device 102 . In particular embodiments, display device 102 and input surface 104 are part of a touch screen.

図２に示すように、コンピュータデバイス１００は、プロセッサ１０６及びメモリ１０８をさらに備えている。コンピュータデバイス１００は、メモリ１０８の一部として、またはメモリ１０８とは別に、１つまたは複数の揮発性記憶要素（ＲＡＭ）をも備えている場合がある。 As shown in FIG. 2, computing device 100 further comprises processor 106 and memory 108 . Computing device 100 may also include one or more volatile storage elements (RAM) as part of memory 108 or separate from memory 108 .

プロセッサ１０６は、ソフトウェア、特にメモリ１０８に記憶されたソフトウェアを実行するためのハードウェアデバイスである。プロセッサ１０８は、任意の特注であるか商業利用可能な多目的プロセッサ、中央処理ユニット（ＣＰＵ）、半導体ベースのマイクロプロセッサ（マイクロチップまたはチップセットの形態）、マイクロコントローラ、デジタル信号プロセッサ（ＤＳＰ）、特定用途向け集積回路（ＡＳＩＣ）、フィールドプログラマブルゲートアレイ（ＦＰＧＡ）もしくは他のプログラマブル論理デバイス、またはこれらの任意の組合せとすることができ、より概略的には、当業者に既知であるような、ソフトウェア命令を実行するために設計された任意の適切なプロセッサ構成要素とすることができる。 Processor 106 is a hardware device for executing software, particularly software stored in memory 108 . Processor 108 can be any custom or commercially available general-purpose processor, central processing unit (CPU), semiconductor-based microprocessor (in the form of a microchip or chipset), microcontroller, digital signal processor (DSP), specific software, which may be an application-specific integrated circuit (ASIC), field-programmable gate array (FPGA) or other programmable logic device, or any combination thereof, and more generally, as known to those skilled in the art. It can be any suitable processor component designed to execute instructions.

メモリ１０８は、本開示の特定の実施形態に係る、非一時的な（または不揮発性の）コンピュータ可読媒体（または読取り媒体）を構成する（または備えている）。メモリ１０８は、不揮発性の記憶要素（たとえば、ＲＯＭ、ＥＰＲＯＭ、フラッシュＰＲＯＭ、ＥＥＰＲＯＭ、ハードドライブ、磁気テープまたは光学テープ、メモリレジスタ、ＣＤ－ＲＯＭ、ＷＯＲＭ、ＤＶＤなど）の任意の組合せを含む場合がある。 Memory 108 constitutes (or comprises) a non-transitory (or non-volatile) computer-readable medium (or readable medium) in accordance with certain embodiments of the present disclosure. Memory 108 may include any combination of non-volatile storage elements (eg, ROM, EPROM, flash PROM, EEPROM, hard drive, magnetic or optical tape, memory registers, CD-ROM, WORM, DVD, etc.). be.

メモリ１０８は、コンピュータデバイス１００によってリモートでアクセス可能である、サーバまたはクラウドベースのシステムにあるなど、コンピュータデバイス１００から離れている場合がある。不揮発性メモリ１０８は、プロセッサ１０６に結合されており、それにより、プロセッサ１０６が、メモリ１０８から情報を読み取ること、及びメモリ１０８に情報を書き込むことが可能であるようになっている。代替形態として、メモリ１０８は、コンピュータデバイス１００に組み込まれている。 Memory 108 may be remote from computing device 100 , such as in a server or cloud-based system that is remotely accessible by computing device 100 . Non-volatile memory 108 is coupled to processor 106 such that processor 106 can read information from, and write information to, memory 108 . Alternatively, memory 108 is embedded in computing device 100 .

メモリ１０８は、オペレーティングシステム（ＯＳ）１１０と、手書きアプリケーション（またはコンピュータプログラム）１１２と、を含んでいる。オペレーティングシステム１１０は、アプリケーション１１２の実行を制御する。アプリケーション１１２は、本発明の特定の実施形態に従ってコンピュータプログラム（またはコンピュータ可読プログラムコード）を構成する（または備えている）。このコンピュータプログラムは、本発明の特定の実施形態に従って方法を実施する命令を含んでいる。 Memory 108 includes an operating system (OS) 110 and a handwriting application (or computer program) 112 . Operating system 110 controls the execution of applications 112 . Application 112 constitutes (or comprises) a computer program (or computer readable program code) according to a particular embodiment of the invention. This computer program contains instructions for performing a method according to a particular embodiment of the invention.

アプリケーション１１２は、コンピュータデバイス１００の入力表面１０４を使用して、ユーザによって手書きされたインクの入力要素を検出及び管理するための命令を含む場合がある。後に論じるように、テキストまたはテキストではない場合がある、これら手書きされたインク入力要素（手書き入力とも称される）は、デジタルインクの１つまたは複数のストロークによって形成される。 Application 112 may include instructions for detecting and managing ink input elements handwritten by a user using input surface 104 of computing device 100 . As discussed below, these handwritten ink input elements (also referred to as handwriting input), which may or may not be text, are formed by one or more strokes of digital ink.

アプリケーション１１２は、手書きされたテキストまたはテキストではないものを含む、コンピュータデバイス１００への手書き入力を認識するための、手書き認識（ＨＷＲ）モジュール（またはシステム）１１４を備えている場合がある。ＨＷＲ１１４は、実施される命令のセットを有するソースプログラム、実行可能なプログラム（オブジェクトコード）、スクリプト、アプリケーション、または任意の他の構成要素である場合がある。図２に示す本実施例では、アプリケーション１１２及びＨＷＲモジュール１１４は、単一のアプリケーションに組み合わせられる（ＨＷＲモジュール１１４は、アプリケーション１１２の一部である）。代替的には、ＨＷＲモジュール１１４は、図２に示されているように、コンピュータデバイス１００により、適切な通信リンクを通してリモートでアクセス可能である、サーバ（またはクラウドベースのシステム）ＳＶ１などの、コンピュータデバイス１００から離れている手書き認識システムと通信するためのモジュール、方法、またはシステムである場合がある。アプリケーション１１２及びＨＷＲモジュール１１４も、コンピュータデバイス１００のメモリ１０８（または異なるメモリ）に記憶された別々の構成要素である場合があり、それにより、アプリケーション１１２及びＨＷＲモジュール１１４が、ともに作動して、処理され、メモリ１０８に記憶された情報にアクセスする。 Application 112 may include a handwriting recognition (HWR) module (or system) 114 for recognizing handwritten input to computing device 100, including handwritten text or non-text. HWR 114 may be a source program, executable program (object code), script, application, or any other component having a set of instructions to be implemented. 2, application 112 and HWR module 114 are combined into a single application (HWR module 114 is part of application 112). Alternatively, the HWR module 114 is a computer, such as a server (or cloud-based system) SV1, which is remotely accessible through a suitable communication link by the computing device 100, as shown in FIG. It may be a module, method or system for communicating with a handwriting recognition system remote from device 100 . Application 112 and HWR module 114 may also be separate components stored in memory 108 (or different memories) of computing device 100 such that application 112 and HWR module 114 work together to process and access information stored in memory 108 .

図に後に示すように、入力表面１０４上で、または入力表面１０４を介して入力された入力ストロークは、デジタルインクとしてプロセッサ１０６によって処理される。デジタルインクは、このケースではディスプレイデバイス１０２上で、デジタル画像フォーマットで手書き入力をレンダリングすることによって形成される。 As shown later in the figures, input strokes entered on or through input surface 104 are processed by processor 106 as digital ink. Digital ink is formed by rendering handwritten input in a digital image format, in this case on display device 102 .

ユーザは、手もしくは指、または、入力表面１０４で使用するために適切なデジタルペンもしくはスタイラスなどのいくつかの入力器具で、入力ストロークを入力する場合がある。ユーザは、入力表面１０４の近位のモーションを検知するように構成された手段が使用される場合に、入力表面１０４の上方でジェスチャをすることにより、または、マウスもしくはジョイスティックなどのコンピュータデバイス１００の周辺機器で、入力ストロークを入力する場合もある。 A user may enter input strokes with a hand or finger, or with some input device such as a digital pen or stylus suitable for use with input surface 104 . The user can gesture by gesturing over input surface 104 or moving computing device 100, such as a mouse or joystick, when means configured to detect motion proximal to input surface 104 are used. Peripheral devices may also input input strokes.

各インク入力要素（文字、シンボル、ワード、形状など）は、１つまたは複数のそのような入力ストロークによって、または少なくともストロークの一部によって形成される。ストローク（または入力ストローク）は、ストロークの開始位置（「ペンを下げる」事象に対応する）、ストロークの末端位置（「ペンを上げる」事象に対応する）、及びストロークの開始位置とストロークの末端位置とを接続する経路によって少なくとも特徴付けられている。異なるユーザが、いくつかのオブジェクト（たとえば、文字、形状、シンボルなど）を、わずかな変化を伴って自然に書くか手で線描する場合があることから、ＨＷＲモジュール１１４は、依然として適正であるか意図されているオブジェクトとして認識される間、各オブジェクトが入力され得る様々な方法を許容する。 Each ink input element (character, symbol, word, shape, etc.) is formed by one or more such input strokes, or at least part of a stroke. A stroke (or input stroke) has a stroke start position (corresponding to a "pen down" event), a stroke end position (corresponding to a "pen up" event), and a stroke start position and a stroke end position. characterized at least by a path connecting the Since different users may naturally write or hand draw some objects (e.g., letters, shapes, symbols, etc.) with slight variations, the HWR module 114 is still appropriate. It allows for a variety of ways in which each object can be entered while being recognized as an intended object.

手書きアプリケーション１１２により、手書きまたは手での線描のコンテンツ（たとえば、テキスト、図、チャート、形状、図、または任意の種類のテキスト及び／またはテキストではない手書き入力）を、デジタルインクの形態で生成することを可能にし、これらコンテンツを、ＨＷＲモジュール１１４を使用して正確に認識させる。 Handwriting application 112 generates handwritten or hand-drawn content (e.g., text, drawings, charts, shapes, drawings, or any type of textual and/or non-textual handwritten input) in the form of digital ink. and accurately recognize these contents using the HWR module 114 .

本実施例では、メモリ１０８は、コンピュータデバイス１０２上のユーザからの、入力表面１０４での手書きテキスト入力を示すモデルドキュメントを規定するモデルデータＤＴを記憶するためにも適している。そのようなモデルデータＤＴの性質及び用途は、以下により詳細に論じられることになる。 In this example, memory 108 is also suitable for storing model data DT defining a model document showing handwritten text input on input surface 104 from a user on computing device 102 . The properties and uses of such model data DT will be discussed in more detail below.

図３に示すように、本実施形態では、コンピュータデバイス１００は、フリーハンドライティングフォーマット（またはフリーハンドライティングモード）ＦＴ１で、すなわち、ユーザのためのいずれの手書きの制約をも伴わずに、入力表面１０４で入力された手書き入力ＩＮを検出するように構成されている。フリーハンドライティングモードは、ユーザに、構造化されていないかガイドされていない方式で、フリーの環境（たとえば、ブランクゾーンＺ１内）で入力要素ＩＮを手書きすることを可能にする。すなわち、手書きテキスト入力の位置、サイズ、及び向きのいずれの手書きの制約も伴わない（ラインパターンに従わず、サイズまたは向きの制限がなく、ライン間への挿入、余白などの制約がない、などである）。このフリーハンドライティングモードＦＴ１は、手書き入力の間に、ユーザに完全な自由度を与える。このことは、ときには、たとえば迅速かつ種々雑多にノートを取るか、テキストとテキストではないものとの混合の入力をするために、望ましい。 As shown in FIG. 3, in this embodiment, computing device 100 writes the input surface in freehand writing format (or freehand writing mode) FT1, i.e., without any handwriting constraints for the user. It is configured to detect handwritten input IN entered at 104 . The freehand writing mode allows the user to handwrite the input element IN in a free environment (eg, within blank zone Z1) in an unstructured or unguided manner. That is, it does not entail any handwriting constraints on the position, size, and orientation of handwritten text input (does not follow line patterns, no size or orientation restrictions, no constraints on insertion between lines, margins, etc.). is). This freehand writing mode FT1 gives the user complete freedom during handwriting input. This is sometimes desirable, for example, for quick and miscellaneous note-taking or for entering a mixture of text and non-text.

入力表面１０４によって検出され、ディスプレイデバイス１０２によって表示される手書き入力ＩＮは、デジタルインクの複数の入力ストロークによって形成される。 The handwritten input IN detected by the input surface 104 and displayed by the display device 102 is formed by multiple input strokes of digital ink.

図３に示すように、コンピュータデバイス１００は、手書き入力ＩＮを形成する入力ストロークをテキストとして分類し、こうして、手書き入力ＩＮをテキストブロックＢＬ１として認識する場合がある。 As shown in FIG. 3, computing device 100 may classify the input strokes forming handwritten input IN as text, thus recognizing handwritten input IN as text block BL1.

コンピュータデバイス１００は、認識されたテキストブロックＢＬ１上で正規化プロセスを実施するように構成されており、それにより、テキストブロックＢＬ１をフリーハンドライティングフォーマットＦＴ１から、構造化されたフォーマット（または構造化されたモード）とも称される構造化されたハンドライティングフォーマット（または構造化されたハンドライティングモード）ＦＴ２に変換して、ドキュメントパターン２００（たとえば、ドキュメントパターン２００のラインパターン）に従う。ドキュメントパターンは、手書きテキスト入力ＩＮがどのようにディスプレイデバイス１０２上で配置及び表示されることになるかに関する、幾何学的制約を規定するものとして、本開示では理解される。ドキュメントパターン２００は、構造化された方式で、すなわち、構造化されたハンドライティングフォーマットＦＴ２で、手書き入力ＩＮ（このケースでは、テキストブロックＢＬ１）を受領するために、構造化された（またはフォーマットされた）環境を構成する。 The computing device 100 is configured to perform a normalization process on the recognized text block BL1, thereby converting the text block BL1 from the freehand writing format FT1 into a structured format (or into a structured handwriting format (or structured handwriting mode) FT2, also referred to as a structured handwriting mode), following the document pattern 200 (eg, the line pattern of the document pattern 200). Document patterns are understood in this disclosure as defining geometric constraints on how handwritten text input IN is to be arranged and displayed on display device 102 . The document pattern 200 is structured (or formatted) to receive the handwritten input IN (in this case the text block BL1) in a structured way, i.e. in the structured handwriting format FT2. ) configure the environment.

ドキュメントパターン２００が、手書きがどのように構造的に配置されるかを規定することに留意されたい。 Note that the document pattern 200 defines how the handwriting is arranged structurally.

ドキュメントパターンは、
表示エリアの余白と、
ラインパターンと、
ライン間の距離と、
のテキストが従う手書きの制約の少なくとも１つを規定する場合がある。 The document pattern is
the margins of the display area,
line pattern and
the distance between the lines and
may define at least one of the handwritten constraints that the text of .

具体的には、ドキュメントパターン２００は、ラインパターン、及び、場合によっては、表示エリアの余白及びライン間の距離の少なくとも１つをも規定する場合がある。 Specifically, the document pattern 200 may also define a line pattern and possibly also at least one of the margins of the display area and the distance between the lines.

図３に示す本実施例では、ドキュメントパターン２００は、複数のガイドライン（ガイディングラインまたはベースラインとも称される）２０２を規定するラインパターンを含んでいる。このガイドラインに従って、手書きテキスト（すなわち、テキストブロックＢＬ１のコンテンツ）が構造化されたハンドライティングフォーマットＦＴ２に応じて配置されることになる。ドキュメントパターン２０２は、連続したガイドラインの各対の間の所定のライン間距離ｄ１を規定する場合があり、それにより、構造化されたハンドライティングフォーマットＦＴ２における手書きテキストのサイズの制約を課す。ドキュメントパターン２０２は、所定のライン長さｄ２をも規定する場合がある。このライン長さｄ２は、構造化されたフォーマットＦＴ２におけるテキストブロックＢＬ１の各テキストラインの最大長さを課す。この構造化されたフォーマットＦＴ２は、ドキュメントパターン２００に基づいており、実施態様の単なる実施例を構成するに過ぎないことを理解されたい。他のドキュメントパターンが、当業者によって予期され得る。 In the present example shown in FIG. 3, document pattern 200 includes a line pattern that defines a plurality of guidelines (also called guiding lines or baselines) 202 . According to this guideline, the handwritten text (ie the contents of the text block BL1) will be arranged according to the structured handwriting format FT2. The document pattern 202 may define a predetermined line-to-line distance d1 between each pair of consecutive guide lines, thereby imposing size constraints on handwritten text in structured handwriting format FT2. Document pattern 202 may also define a predetermined line length d2. This line length d2 imposes the maximum length of each text line of text block BL1 in structured format FT2. It should be understood that this structured format FT2 is based on document pattern 200 and constitutes merely an example of implementation. Other document patterns can be anticipated by those skilled in the art.

特定の実施形態に従って図４に示すように、メモリ１０８に記憶されたアプリケーション１１２を実行する場合（図２）、プロセッサ１０６は、複数のプロセッシングモジュール、すなわち、分類器ＭＤ２、ライン抽出器ＭＤ４、認識エンジンＭＤ６、及び編集モジュールＭＤ８を実施する。アプリケーション１１２は、特定の実施形態において後に記載するように、本発明の方法のステップを実施するために、これらモジュールを実施するようにプロセッサを構成する命令を含んでいる。 As shown in FIG. 4 in accordance with certain embodiments, when executing an application 112 stored in memory 108 (FIG. 2), processor 106 includes multiple processing modules: classifier MD2, line extractor MD4, recognition It implements an engine MD6 and an editing module MD8. Application 112 includes instructions that configure the processor to implement these modules in order to perform the steps of the method of the present invention, as described later in a particular embodiment.

分類器ＭＤ２は、入力表面１０４によって検出された各入力ストローク（または、入力ストロークの任意のセットもしくは組合せ）を、テキストまたはテキストではないものとして分類するように構成されている。具体的には、以下にさらに記載するように、分類器ＭＤ２は、入力表面１０４で、フリーハンドライティングフォーマットＦＴ１で入力された入力ストロークの少なくとも１つのテキストブロックＢＬ１をテキストとして検出するように構成されている場合がある。 Classifier MD2 is configured to classify each input stroke (or any set or combination of input strokes) detected by input surface 104 as text or non-text. Specifically, as described further below, classifier MD2 is configured to detect at least one text block BL1 of input strokes entered in freehand writing format FT1 at input surface 104 as text. may be

このため、分類器（または分類モジュール）ＭＤ２は、ユーザによって入力された入力デジタルインクのテキストではないコンテンツからテキストを識別するために、あいまいさ排除プロセスを実施する場合がある。あいまいさ排除プロセスは、当業者に既知である任意の方式によって実施される場合がある。例示的な実施態様は、米国特許出願公開第２０１７／０１０９５７８（Ａ１）号に記載されている。 Thus, the classifier (or classification module) MD2 may perform a disambiguation process to identify text from the non-textual content of the input digital ink entered by the user. The disambiguation process may be performed by any method known to those skilled in the art. Exemplary implementations are described in US Patent Application Publication No. 2017/0109578A1.

あいまいさ排除プロセスの実施例として、分類器ＭＤ２は、どのストロークがテキストではない要素またはテキストである要素に属している場合があるかの仮説を立てるために、空間的及び一時的な考慮事項を使用してストロークをグループ化する場合がある。空間的な考慮事項には、ストローク間の距離、ストロークの幾何学形状、ストロークの重なり、ストロークの相対位置などが含まれる場合がある。一時的な考慮事項には、ストロークの入力の時間順序が含まれる場合がある。確率のスコアは、（閾値を超える）十分に高いスコアの仮説のみが保持されるように計算される場合がある。ストロークの各グループに関する特徴は、こうして、形状及びテキスト言語モデルを考慮して抽出される。特徴には、分離距離、ストローク内の方向の変化、重なり、ストロークパターンの方向、湾曲の程度などが含まれる場合がある。ストロークは、このため、収集された情報すべてに基づいて仮説をテストすることにより、テキストとテキストではないものとに、分類器ＭＤ２によって分類される場合がある。収集された情報には、これら仮説内のストロークのグループの抽出された特徴、ならびに、これらグループ内のストロークの空間的及び一時的な情報が含まれる。しかし、すでに述べたように、テキストではないもの／テキストの区別の他の技術が予期される場合がある。 As an example of a disambiguation process, the classifier MD2 incorporates spatial and temporal considerations to hypothesize which strokes may belong to non-textual or textual elements. may be used to group strokes. Spatial considerations may include distance between strokes, stroke geometry, stroke overlap, relative position of strokes, and the like. Temporal considerations may include temporal order of input of strokes. Probability scores may be computed such that only hypotheses with sufficiently high scores (above a threshold) are retained. Features for each group of strokes are thus extracted considering the shape and text language model. Features may include separation distance, change in direction within a stroke, overlap, direction of the stroke pattern, degree of curvature, and the like. Strokes may thus be classified as text and non-text by classifier MD2 by testing hypotheses based on all the information gathered. The information collected includes extracted features of groups of strokes within these hypotheses, as well as spatial and temporal information for strokes within these groups. However, as already mentioned, other techniques of non-text/text discrimination may be envisaged.

他の実施例では、テキストのみがコンピュータデバイス１００によって手書き入力として受領され、それにより、手書きテキストが、入力表面１０４で直接検出されるようになっている際には、テキスト／テキストではないものとして分類することは必要ではない。 In other embodiments, only text is received by computing device 100 as handwritten input, such that handwritten text is considered text/not text when directly detected at input surface 104. Classification is not necessary.

ライン抽出器（またはライン抽出モジュール）ＭＤ４は、分類器ＭＤ２によって検出された入力ストロークのテキストブロックＢＬ１からテキストラインＬＮを抽出するように構成されている。換言すると、ライン抽出器ＭＤ４は、テキストブロックＢＬ１を集合的に形成する別個のテキストラインＬＮを識別することが可能である（このため、テキストブロックＢＬ１は、複数のテキストラインＬＮに分割される）。以下にさらに記載するように、そのようなラインの抽出は、テキスト認識プロセスの一部として実施される場合がある。特定の実施例では、ライン抽出器ＭＤ４及び認識エンジンＭＤ６は、ともに単一のモジュールを形成する。 A line extractor (or line extraction module) MD4 is configured to extract text lines LN from text blocks BL1 of input strokes detected by classifier MD2. In other words, line extractor MD4 is able to identify separate text lines LN that collectively form text block BL1 (thus dividing text block BL1 into multiple text lines LN). . As described further below, such line extraction may be performed as part of the text recognition process. In a particular embodiment, line extractor MD4 and recognition engine MD6 together form a single module.

認識エンジン（または認識モジュール）ＭＤ６は、分類器ＭＤ２によって抽出されると、テキストブロックＢＬ１の各テキストラインＬＮにテキスト認識を実施するように構成されている。テキスト認識は、たとえば、要素の候補（または仮説）のリストを確率のスコアとともに生成すること、及び、最適な認識の結果を見出すために、要素の候補に言語モデル（用語集、文法、意味論など）を適用することを伴う、当業者には既知である任意の適切な技術を使用して実施される場合がある。所与の要素のシーケンスがどれだけの頻度で特定の言語内に現れるか、または、特定のユーザによって使用されるかに関してモデル化する統計的情報も、認識エンジンＭＤ６によって提供される解釈の可能性を評価するために、考慮される場合がある。しかし、本開示を不必要に不明瞭にすることを避けるために、さらなる詳細は提供されない。手書き認識を実施することの一実施例は、たとえば、米国特許出願公開第２０１７／０１０９５７８（Ａ１）号に見ることができる。 A recognition engine (or recognition module) MD6 is arranged to perform text recognition on each text line LN of the text block BL1 as extracted by the classifier MD2. Text recognition involves, for example, generating a list of element candidates (or hypotheses) with probability scores, and applying a language model (lexicon, grammar, semantics) to the element candidates to find the optimal recognition result. etc.) may be performed using any suitable technique known to those skilled in the art. Statistical information modeling how often a given sequence of elements appears in a particular language or is used by a particular user is also the interpretation possibilities provided by the recognition engine MD6. may be considered to assess However, further details are not provided to avoid unnecessarily obscuring the present disclosure. An example of implementing handwriting recognition can be found, for example, in US Patent Application Publication No. 2017/0109578A1.

本実施形態では、テキスト認識を実施する間、認識エンジンＭＤ６は、テキストブロックＢＬ１の各入力ストロークを、テキストブロックＢＬ１の特有の文字、特有のワード、及び特有のテキストラインと関連付けるモデルデータＤＴを生成するように構成されている。以下にさらに記載するように、モデルデータＤＴは、手書きテキスト入力の各ストロークの、手書きテキスト入力の対応する文字、対応するワード、及び対応するテキストラインとの間の相関関係（またはリンクまたは参照）を規定する。 In this embodiment, while performing text recognition, the recognition engine MD6 generates model data DT associating each input stroke of the text block BL1 with a unique character, a unique word and a unique text line of the text block BL1. is configured to As further described below, the model data DT is the correlation (or link or reference) between each stroke of the handwritten text input with the corresponding letter, the corresponding word, and the corresponding text line of the handwritten text input. stipulate.

編集モジュール（またはテキストエディタ）ＭＤ８は、（図３に示すように）ドキュメントパターン２００に従うように、各テキストラインＬＮを、上述のフリーハンドライティングフォーマットＦＴ１から構造化されたフォーマットＦＴ２に正規化するように構成されている。以下にさらに記載するように、この正規化の間、編集モジュールＭＤ８は、各テキストラインＬＮに関し、
ドキュメントパターン２００（たとえば、ドキュメントパターン２００のラインパターン）に従うように、前述のテキストラインＬＮを構造化されたフォーマットＦＴ２に変換するように、前述のテキストラインＬＮに関し、それぞれの変換関数を計算することと、
前述のテキストラインＬＮの各入力ストロークを構造化されたフォーマットＦＴ２に変換するように、それぞれの変換関数を適用することと、
それぞれの変換関数に基づき、モデルデータＤＴ（及びより具体的には、前述のテキストラインＬＮに関するモデルデータＤＴ）をアップデートすることと、
を実施するように構成されている場合がある。 The editing module (or text editor) MD8 is adapted to normalize each text line LN from the freehand writing format FT1 described above to the structured format FT2 so as to follow the document pattern 200 (as shown in FIG. 3). is configured to As further described below, during this normalization, the editing module MD8, for each text line LN:
Calculating respective transformation functions for said text lines LN to transform said text lines LN into structured format FT2 so as to conform to the document pattern 200 (e.g. the line pattern of the document pattern 200). When,
applying a respective transformation function to transform each input stroke of said text line LN into a structured format FT2;
updating the model data DT (and more specifically the model data DT for said text line LN) based on the respective transformation function;
may be configured to implement

換言すると、テキストブロックＢＬ１を正規化する場合、編集モジュールＢＬ１は、ライン毎の基準で、テキストブロックＢＬ１を、フリーハンドライティングフォーマットＦＴ１から構造的な手書きフォーマットＦＴ２に変換する。編集モジュールＢＬ１によって正規化されると、正規化プロセスの一部として、手書きを形成するストロークにいくつかの変換が適用されたとしても、正規化されたコンテンツは依然として、（入力ストロークによって形成された）手書きの性質である。 In other words, when normalizing the text block BL1, the editing module BL1 converts the text block BL1 from the freehand writing format FT1 to the structured handwriting format FT2 on a line-by-line basis. Once normalized by the editing module BL1, even if some transformations were applied to the strokes forming the handwriting as part of the normalization process, the normalized content would still be ) is of a handwritten nature.

正規化が実施されると、ディスプレイデバイス１０２は、構造化されたフォーマットＦＴ２でテキストブロックＢＬ１を表示するように構成されている場合がある。編集モジュールＭＤ８は、構造化されたフォーマットＦＴ２のテキストブロックＢＬ１にさらなる編集を実施するように構成されている場合もある。 Once normalization is performed, display device 102 may be configured to display text block BL1 in structured format FT2. The editing module MD8 may also be arranged to perform further editing on the text block BL1 of structured format FT2.

コンピュータデバイス１００のモジュールＭＤ２からＭＤ８の構成及び動作は、図面を参照して以下に記載される特定の実施形態でより明らかである。図４に示すようなモジュールＭＤ２からＭＤ６が、本発明の例示的な実施形態を示すのみであり、他の実施態様が可能であることを理解されたい。 The configuration and operation of modules MD2 through MD8 of computing device 100 are more apparent in the specific embodiments described below with reference to the drawings. It should be understood that the modules MD2 to MD6 as shown in FIG. 4 only represent an exemplary embodiment of the invention and that other implementations are possible.

本発明の方法の各ステップに関し、コンピュータデバイスは、前述のステップを実施するように構成された対応するモジュールを備えている場合がある。これらステップの少なくとも２つが、単一のモジュールの一部として実施される場合がある。 For each step of the method of the invention, the computing device may have a corresponding module configured to perform the aforementioned steps. At least two of these steps may be implemented as part of a single module.

図２から図４に図示されるコンピュータデバイス１００によって実施される方法を、本発明の特定の実施形態に従って、ここで図５から図１２を参照して記載する。より明確には、コンピュータデバイス１００は、メモリ１０８に記憶されたアプリケーション１１２を実行することにより、この方法を実施する。 A method performed by the computing device 100 illustrated in FIGS. 2-4 will now be described with reference to FIGS. 5-12, according to a particular embodiment of the invention. More specifically, computing device 100 implements the method by executing application 112 stored in memory 108 .

ユーザが図６に示すような手書き入力ＩＮをコンピュータデバイス１００上で入力する例示的なシナリオが考慮される。次いで、以下に記載するような手書き入力ＩＮ上の正規化プロセスを含む処理が、コンピュータデバイス１００によって実施される。 Consider an exemplary scenario in which a user enters a handwritten input IN on a computing device 100 as shown in FIG. Processing is then performed by the computing device 100, including a normalization process on the handwritten input IN as described below.

より明確には、検出ステップＳ２において、コンピュータデバイス１００が、コンピュータデバイス１００の入力表面１０４でユーザによって入力された手書き入力ＩＮを検出する。図６に示すように、手書き入力ＩＮは、ユーザによって入力表面１０４で形成されたデジタルインクの複数の入力ストロークＳＴを含んでいる。たとえば、文字の第１のストリング「Ｔｈｉｓ」は、入力ストロークＳＴ１、ＳＴ２、ＳＴ３、及びＳＴ４によって形成されている。すでに述べたように、各入力ストロークＳＴは、ストロークの開始位置、ストロークの末端位置、及びストロークの開始位置とストロークの末端位置とを接続する経路によって少なくとも特徴付けられている。したがって、たとえば（ワード「Ｔｈｉｓ」内の）文字「ｉ」の頂部に配置される点は、それ自体によって単一のストロークＳＴ４を構成する。 More specifically, in detection step S2, computing device 100 detects handwriting input IN entered by a user on input surface 104 of computing device 100. FIG. As shown in FIG. 6, the handwritten input IN includes a plurality of input strokes ST of digital ink made on the input surface 104 by the user. For example, the first string of characters "This" is formed by input strokes ST1, ST2, ST3, and ST4. As already mentioned, each input stroke ST is characterized at least by a stroke start position, a stroke end position, and a path connecting the stroke start position and the stroke end position. Thus, for example, a point placed on top of the letter "i" (in the word "This") constitutes by itself a single stroke ST4.

本実施例では、手書きのデジタルインクＩＮは、前述したようにフリーハンドライティングフォーマットＦＴ１で入力される（図３）。すなわち、ディスプレイ１０２の所定の入力エリアにあらゆる手書きの制約を伴わない。従うべきライン、サイズ、向きなどの制約がなければ、ユーザは、コンテンツＩＮを自由かつ容易な方式で手書きすることが許容される。見ることができるように、各手書き文字または各手書きワードのサイズ、向き、及び位置は、ユーザの好みに応じて任意に変化する場合がある。手書き入力ＩＮが、実質的に平行なラインに沿って図６に示されているが、手書き入力ＩＮは、変化するラインの向きに沿って配置される場合がある。 In this example, the handwritten digital ink IN is entered in the freehand writing format FT1 as previously described (FIG. 3). That is, without any handwriting restrictions on a given input area of the display 102 . With no constraints such as line, size, orientation, etc. to follow, the user is allowed to handwrite the content IN in a free and easy manner. As can be seen, the size, orientation, and position of each handwritten character or each handwritten word may vary arbitrarily according to user preferences. Although the handwriting IN is shown in FIG. 6 along substantially parallel lines, the handwriting IN may be arranged along varying line orientations.

図７に示すように、コンピュータデバイス１００は、フリーハンドライティングフォーマット（またはモード）ＦＴ１に従って、ディスプレイデバイス１０２上に手書き入力ＩＮの複数の入力ストロークＳＴを表示する（Ｓ４）。 As shown in FIG. 7, computing device 100 displays a plurality of input strokes ST of handwritten input IN on display device 102 according to freehand writing format (or mode) FT1 (S4).

分類ステップＳ６では、コンピュータデバイス１００は、フリーハンドライティングフォーマットＦＴ１で検出された各入力ストロークＳＴを、テキストまたはテキストではないもののいずれかに分類する。この目的のために、コンピュータデバイス１００の分類器ＭＤ６は、前に記載したあいまいさ排除プロセスを、任意の適切な方式で実施する場合がある。 In a classification step S6, the computing device 100 classifies each input stroke ST detected in the freehand writing format FT1 as either text or non-text. To this end, classifier MD6 of computing device 100 may implement the previously described disambiguation process in any suitable manner.

簡略化のために、本実施例では、コンピュータデバイス１００によって検出された手書き入力ＩＮ全体がテキスト（すなわち、手書きテキスト）であると推定する。したがって、分類ステップＳ６の間、コンピュータデバイス１００は、フリーハンドライティングフォーマットＦＴ１で入力された入力ストロークＳＴによって形成された手書きテキストのテキストブロックＢＬ１を、テキストとして検出する。簡略化のために、手書きテキストの単一のテキストブロックＢＬ１が、本実施例ではコンピュータデバイス１００によって検出されるが、本発明のコンセプトは、手書き入力ＩＮにおいて、コンピュータデバイス１００によって検出された手書きテキストの複数のテキストブロックに、同じ方式で適用されることに留意されたい。 For simplicity, this example assumes that the entire handwritten input IN detected by computing device 100 is text (ie, handwritten text). Thus, during the classification step S6, the computing device 100 detects as text a text block BL1 of handwritten text formed by the input strokes ST entered in the freehand writing format FT1. For simplicity, a single text block BL1 of handwritten text is detected by the computing device 100 in this example, but the concept of the present invention is that in the handwriting input IN, the handwritten text detected by the computing device 100 is Note that the same method applies to multiple text blocks in the .

図８に示すように、このため、コンピュータデバイス１００は、テキストブロックＢＬ１にテキスト認識を実施する（Ｓ８）。テキスト認識Ｓ８の間、コンピュータデバイス１００は、任意の適切な言語に従って、文字ＣＨ（またはシンボル）、及びワードＷＤ（図８）などの、予め規定された認識可能なテキスト要素を認識するように、入力ストロークＳＴの様々な特徴を分析する。すでに述べたように、テキスト認識は、たとえば、要素の候補（または仮説）のリストを確率のスコアとともに生成すること、及び、もっとも適している候補を選択するために、要素の候補に言語モデル（用語集、文法、意味論など）を適用することを含む、当業者には既知である任意の適切な技術を使用して実施される場合がある。このため、認識エンジンＭＤ６は、予め規定された文字を識別するために、任意の適切な言語学的情報を使用して、各テキストラインＬＮのコンテンツをパース及び分析する場合がある。 As shown in FIG. 8, the computer device 100 therefore performs text recognition on the text block BL1 (S8). During text recognition S8, the computing device 100 is configured to recognize predefined recognizable text elements, such as the letters CH (or symbols) and the word WD (FIG. 8), according to any suitable language. Various features of the input stroke ST are analyzed. As already mentioned, text recognition involves, for example, generating a list of element candidates (or hypotheses) with probability scores, and subjecting the element candidates to a language model ( terminology, grammar, semantics, etc.). As such, the recognition engine MD6 may parse and analyze the content of each text line LN using any suitable linguistic information to identify predefined characters.

テキスト認識Ｓ８は、以下に記載される２つのステップ、すなわち、ライン抽出ステップＳ１０と、生成ステップＳ１２と、を含んでいる。 Text recognition S8 includes two steps described below: a line extraction step S10 and a generation step S12.

より明確には、テキスト認識Ｓ８の間、コンピュータデバイス１００は、手書きテキストのテキストラインＬＮをテキストブロックＢＬ１から抽出するために、ライン抽出Ｓ１０を実施する。本実施例では、コンピュータデバイス１００は、テキストブロックＢＬ１を５つの別個のテキストラインＬＮ１からＬＮ５に分割する。ライン抽出は、テキスト／テキストではないものの分類Ｓ６の間に検出されたテキストブロックＢＬ１のストロークＳＴ（またはストロークＳＴのセット）の幾何学的分析に基づき、ライン抽出器ＭＤ４によって実施される場合がある。テキスト分析に基づき、ライン抽出器ＭＤ４は、各入力ストロークＳＴがどのテキストラインに属するかを判定することが可能である。当業者は、フリーハンドライティングモードで入力されたテキストブロック内のテキストラインの識別を可能にする任意の適切な技術を実施する場合がある。 More specifically, during text recognition S8, computing device 100 performs line extraction S10 to extract text lines LN of handwritten text from text block BL1. In this example, computing device 100 divides text block BL1 into five separate text lines LN1 to LN5. Line extraction may be performed by line extractor MD4 based on geometric analysis of strokes ST (or set of strokes ST) of text block BL1 detected during text/non-text classification S6. . Based on the text analysis the line extractor MD4 is able to determine which text line each input stroke ST belongs to. Those skilled in the art may implement any suitable technique that enables identification of text lines within a block of text entered in freehand writing mode.

テキスト認識Ｓ８の間でもなお、コンピュータデバイス１００は、テキストブロックＢＬ１の各ストロークＳＴを、テキストブロックＢＬ１の特有の文字ＣＨ、特有のワードＷＤ、及び特有のテキストラインＬＮと関連付けるモデルデータＤＴを生成もする（Ｓ１２）。 Even during text recognition S8, the computing device 100 also generates model data DT associating each stroke ST of the text block BL1 with a unique character CH, a unique word WD and a unique text line LN of the text block BL1. (S12).

モデルデータＤＴは、テキストブロックＢＬ１のドキュメントモデル（相互作用するテキストモデルとも称される）を規定する。各テキストラインＬＮは、このドキュメントモデルに応じて構築される。後に示すように、このことは、インクの相互作用性が後に、テキストブロックＢＬ１を編集する際に達成され得ることのこれら相関関係を確立することによるものである。たとえば、テキストブロックＢＬ１内のワードＷＤを消去することは、コンピュータデバイス１００が同様に、モデルデータＤＴに従って、このワードＷＤに関して参照される構成文字ＣＨのすべてを消去することに繋がる。 The model data DT define the document model (also called interactive text model) of the text block BL1. Each text line LN is constructed according to this document model. As will be shown later, this is by establishing these correlations that ink interactivity can be achieved later in editing text block BL1. For example, erasing word WD in text block BL1 leads to computing device 100 similarly erasing all constituent characters CH referenced with respect to this word WD according to model data DT.

本実施例では、モデルデータＤＴは、文字情報、ワード情報、及びライン情報の３つのカテゴリーを含んでいる。任意の適切な方式（たとえば、関係のツリーまたは参照のツリーとして）で系統立てられる場合があるモデルデータは、各ストロークＳＴ（またはストロークの各部）を特有の文字ＣＨにリンクさせ、各文字ＣＨを特有のワードＷＤにリンクさせ、また、各ワードＷＤを特有のテキストラインＬＮにリンクさせる相互参照を規定し、それにより、テキスト認識Ｓ８で検出されたストロークＳＴ（またはストロークの部分）と、文字ＣＨと、ワードＷＤと、テキストラインＬＮとの間の相関関係を確立する。結果として、デジタルインクは、構造化され、モデルデータＤＴと関連付けられて、相互作用性インクを形成する。 In this embodiment, the model data DT includes three categories of character information, word information and line information. The model data, which may be organized in any suitable manner (e.g., as a tree of relations or a tree of references), links each stroke ST (or parts of a stroke) to a unique character CH, each character CH We define a cross-reference linking to a unique word WD and linking each word WD to a unique text line LN, so that the stroke ST (or part of a stroke) detected in text recognition S8 and the character CH , the word WD and the text line LN. As a result, digital ink is structured and associated with model data DT to form interactive ink.

より具体的には、図９に示すように、特定の実施形態によれば、テキストブロックＢＬ１のモデルデータＤＴは、
複数の文字ＣＨを規定する文字情報２２０であって、各文字ＣＨが、デジタルインクの少なくとも１つのストロークＳＴ（または、ストロークＳＴの少なくとも一部）、及び、少なくとも１つのテキストブロックＢＬ１のテキストラインＬＮと関連付けられている、文字情報２２０と、
複数のワードＷＤを規定するワード情報２２２であって、各ワードＷＤが、文字情報２２０によって規定された少なくとも１つの文字ＣＨと関連付けられている、ワード情報２２２と、
テキストブロックＢＬ１の各テキストラインＬＮを規定するライン情報２２４であって、各テキストラインＬＮが、ワード情報２２２によって規定された少なくとも１つのワードＷＤ（またはワードの一部）と関連付けられている、ライン情報２２４と、
を含む場合がある。 More specifically, as shown in FIG. 9, according to a particular embodiment, the model data DT for text block BL1 are:
Character information 220 defining a plurality of characters CH, each character CH comprising at least one stroke ST (or at least a portion of a stroke ST) of digital ink and at least one text line LN of a text block BL1. and textual information 220 associated with
word information 222 defining a plurality of words WD, each word WD being associated with at least one character CH defined by character information 220;
line information 224 defining each text line LN of the text block BL1, each text line LN being associated with at least one word WD (or part of a word) defined by the word information 222; information 224;
may include

文字ＣＨが、単一の入力ストロークＳＴ、複数の入力ストロークＳＴ、または、１つもしくは複数の入力ストロークＳＴの一部によって形成される場合があることに留意されたい。文字情報２２０は、ストロークＳＴの各部の、特有の文字ＣＨとのリンクを示すものである。たとえば、図６に示すように、テキスト認識Ｓ１０の間にコンピュータデバイス１００によって認識された文字ＣＨ２、ＣＨ３、及びＣＨ４は、それぞれ、ストロークＳＴ３の部分ＳＴ３１、ＳＴ３２、及びＳＴ３３によって部分的または全体的に形成される。 Note that the character CH may be formed by a single input stroke ST, multiple input strokes ST, or part of one or more input strokes ST. The character information 220 indicates the link between each part of the stroke ST and the specific character CH. For example, as shown in FIG. 6, characters CH2, CH3, and CH4 recognized by computing device 100 during text recognition S10 are partially or entirely by portions ST31, ST32, and ST33, respectively, of stroke ST3. It is formed.

特定の実施例では、ライン情報２２４は、テキストブロックＢＬ１の各テキストラインＬＮに関し、
テキストラインＬＮの原点を示す原点座標（ｘ，ｙ）と、
テキストラインＬＮの傾斜を示す傾斜情報（ａ）と、
テキストラインＬＮの高さを示す高さ情報（ｈ）と、
を含んでいる。 In particular embodiments, line information 224 is for each text line LN of text block BL1:
origin coordinates (x, y) indicating the origin of the text line LN;
tilt information (a) indicating the tilt of the text line LN;
height information (h) indicating the height of the text line LN;
contains.

ライン情報２２４はこうして、コンピュータデバイス１００が、各テキストラインＬＮの位置、向き、及びサイズを規定することを可能にする。 Line information 224 thus allows computing device 100 to define the position, orientation, and size of each text line LN.

テキストラインＬＮに属する任意の所定の基準ポイントが、前述のテキストラインＬＮの、上述の「原点」としてライン情報２２４に規定され得ることに留意されたい。 Note that any predetermined reference point belonging to the text line LN can be defined in the line information 224 as the above-mentioned "origin" of said text line LN.

テキストラインＬＮの原点座標は、浮動小数点の値：ライン（ｘ，ｙ）の対によって規定される場合がある。傾斜情報は、浮動小数点の値：ライン（ａ）である場合がある。高さ情報は、たとえば、ドキュメントモデルに応じてこのテキストラインにアサインされた文字の平均高さを示す、浮動小数点の値：ライン（ｈ）である場合がある。本実施例では、高さ（ｈ）は、所与のテキストラインＬＮの各々に関する傾斜（ａ）に対して垂直に表示される。 The origin coordinates of a text line LN may be defined by a floating point value:line(x,y) pair. The slope information may be a floating point value: line(a). The height information may be, for example, a floating point value: line(h) indicating the average height of characters assigned to this line of text depending on the document model. In this example, the height (h) is displayed perpendicular to the slope (a) for each given text line LN.

各テキストラインＬＮに関する平均高さ（ｈ）の計算は、各テキストラインの結果としての平均値にノイズが発生することを避けるために、アルファベットの上に来る部分及び下に来る部分を考慮するように、コンピュータデバイス１００によって実施される場合がある。 The calculation of the average height (h) for each text line LN should take into account the upper and lower parts of the alphabet to avoid introducing noise into the resulting average for each text line. may also be implemented by computing device 100 .

実施例として、コンピュータデバイス１００は、第１のテキストラインＬＮ１内の入力ストロークＳＴ１からＳＴ４が、特に構造化された方式で、集合的にワードＷＤ１を形成することを認識する（Ｓ１０）。結果として、図６及び図９に示すように、コンピュータデバイス１００がモデルデータＤＴを生成する（Ｓ１２）。このモデルデータＤＴは、テキストラインＬＮ１に関連する、
ストロークＳＴ１及びストロークＳＴ２と関連する文字ＣＨ１と、
ストロークＳＴ３の部分ＳＴ３１と関連する文字ＣＨ２と、
ストロークＳＴ３の（部分ＳＴ３１及び部分ＳＴ３３を補完する）部分ＳＴ３２、ならびにストロークＳＴ４と関連する文字ＣＨ３と、
ストロークＳＴ３の（ＳＴ３１及びＳＴ３２を補完する）部分ＳＴ３３と関連する文字ＣＨ４と、
の文字を規定する文字情報２２０を含んでいる。 As an example, computing device 100 recognizes that input strokes ST1 to ST4 in first text line LN1 collectively form word WD1 in a particularly structured manner (S10). As a result, as shown in FIGS. 6 and 9, the computer device 100 generates model data DT (S12). This model data DT relates to the text line LN1,
a character CH1 associated with stroke ST1 and stroke ST2;
the character CH2 associated with portion ST31 of stroke ST3;
portion ST32 of stroke ST3 (complementary to portions ST31 and ST33) and character CH3 associated with stroke ST4;
character CH4 associated with portion ST33 (complementing ST31 and ST32) of stroke ST3;
contains character information 220 that defines the characters of

テキストブロックＢＬ１内の他の文字ＣＨは、文字情報２２０内で類似の方式で規定される。 Other characters CH within the text block BL1 are defined in a similar fashion within the character information 220. FIG.

この実施例では依然として、ワード情報２２２は、文字ＣＨ１、ＣＨ２、ＣＨ３、及びＣＨ４に関連するワードＷＤ１を規定する。テキストブロックＢＬ１内の他のワードＷＤは、ワード情報２２２において類似の方式で規定される。 Still in this example, word information 222 defines word WD1 associated with characters CH1, CH2, CH3, and CH4. Other words WD in text block BL1 are defined in a similar fashion in word information 222. FIG.

この実施例では依然として、ライン情報２２４は、
原点座標（ｘ１，ｙ１）、傾斜情報（ａ１）、及び高さ情報（ｈ１）を示すテキストラインＬＮ１と、
原点座標（ｘ２，ｙ２）、傾斜情報（ａ２）、及び高さ情報（ｈ２）を示すテキストラインＬＮ２と、
などのラインを規定する。 Still in this example, line information 224 is
a text line LN1 indicating origin coordinates (x1, y1), tilt information (a1), and height information (h1);
a text line LN2 indicating origin coordinates (x2, y2), tilt information (a2), and height information (h2);
Define lines such as

テキストブロックＢＬ１の他のテキストラインＬＮ３、ＬＮ４、及びＬＮ５は、ライン情報２２４において類似の方式で規定される。 Other text lines LN3, LN4, and LN5 of text block BL1 are defined in line information 224 in a similar manner.

しかし、モデルデータＤＴは、図９に示す方式とは別の方式で系統立てられる場合があることを理解されたい。 However, it should be understood that the model data DT may be organized in a different manner than that shown in FIG.

記憶ステップＳ１４では、コンピュータデバイス１００は、Ｓ１２で生成されたモデルデータＤＴをメモリ１０８内に記憶する。 In a storage step S14, the computer device 100 stores the model data DT generated in S12 in the memory .

図１０に示すように、正規化ステップＳ１６では、コンピュータデバイス１００は、テキストブロックＢＬ１の各手書きテキストラインＬＮを、前述したように、フリーハンドライティングフォーマットＦＴ１から構造化されたハンドライティングフォーマットＦＴ２に正規化する。この正規化は、特定の実施例で以下に記載するように、ライン毎の基準で実施される。この正規化は、コンピュータデバイス１００が、手書きテキストのテキストブロックＢＬ１を、制約のない環境（フリーハンドライティングモードＦＴ１）から、フォーマットされているか構造化された環境（構造化されたハンドライティングフォーマットＦＴ２）に変換することを可能にし、それにより、前述の手書きテキストを、ドキュメントパターン２００に従って、より統計立てられ（すなわち、正規化され）、かつ効率的な方式で配置し、後に表示することができるようになっている。 As shown in FIG. 10, in a normalization step S16, the computing device 100 normalizes each handwritten text line LN of the text block BL1 from the freehand writing format FT1 to the structured handwriting format FT2, as described above. become This normalization is performed on a line-by-line basis, as described below in a specific embodiment. This normalization enables computing device 100 to convert text block BL1 of handwritten text from an unconstrained environment (freehand writing mode FT1) into a formatted or structured environment (structured handwriting format FT2). so that the aforementioned handwritten text can be arranged in a more statistical (i.e., normalized) and efficient manner according to the document pattern 200 for later display. It has become.

図１０の実施例では、正規化Ｓ１６の結果が、コンピュータデバイス１００によってディスプレイデバイス１０２上に表示されている。見ることができるように、手書きテキストのテキストラインＬＮ１からＬＮ５は、ドキュメントパターン２００によって規定された構造化されたフォーマットＦＴ２に従って、位置、サイズ、及び向きの一様にされた構成に正規化される。たとえば、テキストラインＬＮ２及びＬＮ４は、ドキュメントパターン２００（より具体的には、ラインパターン）によって課されるライン間距離ｄ１に従うようにダウンサイズされる。手書きテキストのすべてのテキストラインＬＮは、ガイドライン２０２に従って整列及び配置される。すでに示したように、正規化の様々な他の形態が、本発明の中で予期される。 In the example of FIG. 10, the results of normalization S16 are being displayed by computing device 100 on display device 102 . As can be seen, the text lines LN1 to LN5 of the handwritten text are normalized into a uniform configuration of position, size and orientation according to the structured format FT2 defined by the document pattern 200. . For example, text lines LN2 and LN4 are downsized to follow the line-to-line distance d1 imposed by document pattern 200 (more specifically, line pattern). All text lines LN of handwritten text are aligned and placed according to guidelines 202 . As already indicated, various other forms of normalization are contemplated within the present invention.

この正規化Ｓ１６は、テキストブロックＢＬ１の各テキストラインＬＮを、これらが構造化されたハンドライティングフォーマットＦＴ２に従って再配置されるように変換すること（Ｓ２０）によって達成される。以下にさらに記載するように、これら変換は、それぞれの変換関数ＴＦを手書きテキストの各テキストラインＬＮ（すなわち、各テキストラインＬＮのストロークＳＴ）に適用することによって実施される。 This normalization S16 is achieved by transforming (S20) each text line LN of the text block BL1 such that they are rearranged according to the structured handwriting format FT2. As described further below, these transformations are performed by applying a respective transformation function TF to each text line LN of the handwritten text (ie strokes ST of each text line LN).

すでに示したように、正規化を経る各テキストラインＬＮは、手書きテキストのテキストラインである。いくつかの変換がこの手書きテキストラインに、正規化プロセスＳ１６の一部として適用されるが、これらテキストラインＬＮは、正規化されると（すなわち、正規化されたテキストライン）、依然として手書きテキストラインである。すなわち、テキストラインとして、しかし正規化された方式で（たとえば、入力ストロークによって形成された手書きを構成しない活字に組み上がったコンテンツとは反対に）配置される手書きである。以下にさらに記載するように、手書きテキストの各テキストラインＬＮのストロークＳＴは、対応する手書きテキストラインを正規化された手書きテキストラインに変える変換を規定する変換関数ＴＦの影響を受ける。 As already indicated, each text line LN undergoing normalization is a text line of handwritten text. Although some transformations are applied to this handwritten text line as part of the normalization process S16, these text lines LN, once normalized (i.e. normalized text lines), are still handwritten text lines is. That is, handwriting that is laid out as text lines, but in a normalized manner (eg, as opposed to typeset content that does not constitute handwriting formed by input strokes). As further described below, the strokes ST of each text line LN of handwritten text are subjected to a transformation function TF that defines a transformation that transforms the corresponding handwritten text line into a normalized handwritten text line.

さらに、手書きテキストラインＬＮを示すモデルデータＤＴは、正規化Ｓ１６の間にテキストラインＬＮに適用される同じ変換（変換関数によって規定される）に基づき、コンピュータデバイス１００によってアップデートされる（Ｓ２２）。 Additionally, the model data DT representing the handwritten text line LN are updated (S22) by the computing device 100 based on the same transformation (defined by the transformation function) applied to the text line LN during normalization S16.

より明確には、正規化ステップＳ１６（図５）の間、コンピュータデバイス１００は、テキストブロックＢＬ１の各テキストラインＬＮに関するステップＳ１８、Ｓ２０、及びＳ２２を含む同じ反復を実施する。明確化のために、この反復Ｓ１８からＳ２２は、第１のテキストラインＬＮ１のみに関して以下に詳細に記載される。ステップＳ１８からＳ２２は、テキストブロックＢＬ１の各テキストラインＬＮに関し、類似の方式で適用される場合がある。 More specifically, during normalization step S16 (FIG. 5), computing device 100 performs the same iterations including steps S18, S20 and S22 for each text line LN of text block BL1. For clarity, this iteration S18 to S22 will be described in detail below for the first text line LN1 only. Steps S18 to S22 may be applied in an analogous manner for each text line LN of text block BL1.

判定ステップＳ１８の間、コンピュータデバイス１００は、ドキュメントパターン２００に従うように、それぞれの変換関数ＴＦを計算（または判定）して、手書きテキストのテキストラインＬＮ１を構造化されたハンドライティングフォーマットＦＴ２に変換する。特定の実施例では、コンピュータデバイス１００は、Ｓ１８において、テキストラインＬＮ１のライン情報２２４、すなわち、手書きテキストのテキストラインＬＮ１の原点座標（ｘ１，ｙ１）、傾斜情報（ａ１）、及び高さ情報（ｈ１）を含む入力パラメータを取得する。コンピュータデバイス１００は、次いで、手書きテキストの取得された入力パラメータに基づき、かつ、ドキュメントパターン２００に基づき（たとえば、構造化されたフォーマットＦＴ２で適用可能なドキュメントパターン２００のラインパターンを考慮する）、テキストラインＬＮ１に関する変換関数ＴＦを判定する（Ｓ１８）。具体的には、コンピュータデバイス１００は、テキストラインＬＮ１の手書きテキストが、ドキュメントパターン２００の第１のベースライン２０２に移動し、この第１のベースラインに応じて向けられ、また、ドキュメントパターン２００のライン間に応じて拡縮されるように、変換関数ＴＦを判定する場合がある。たとえば、テキストラインＬＮ１の原点は、第１のベースライン２０２の所定の位置に移動される。 During the determination step S18, the computing device 100 calculates (or determines) a respective transformation function TF to follow the document pattern 200 to transform the text lines LN1 of the handwritten text into the structured handwriting format FT2. . In a specific embodiment, the computing device 100, at S18, the line information 224 of the text line LN1, namely the origin coordinates (x1, y1), the inclination information (a1), and the height information ( h1) to get the input parameters, including Computing device 100 then computes the text based on the obtained input parameters of the handwritten text and based on document pattern 200 (e.g. considering the line pattern of document pattern 200 applicable in structured format FT2). A transform function TF for line LN1 is determined (S18). Specifically, computing device 100 determines that handwritten text in text line LN1 moves to first baseline 202 of document pattern 200 and is oriented in accordance with this first baseline; A transform function TF may be determined to scale accordingly between lines. For example, the origin of text line LN1 is moved to a predetermined position on first baseline 202 .

図１１に示すように、それぞれのテキストラインＬＮに関してコンピュータデバイス１００によって計算された各変換関数ＴＦは、正規化ステップＳ１６の間、手書きテキストの前述のテキストラインＬＮ（そしてひいては、このテキストラインＬＮを形成するストロークＳＴ）に適用されるように、
平行移動成分ＣＰ１と、
拡縮成分ＣＰ２と、
回転成分ＣＰ３と、
の少なくとも１つの変換成分を規定する場合がある。 As shown in FIG. 11, each transformation function TF calculated by the computing device 100 for the respective text line LN transforms said text line LN (and thus this text line LN) of the handwritten text during the normalization step S16. As applied to the forming stroke ST),
a translation component CP1;
a scaling component CP2;
a rotation component CP3;
may define at least one transform component of .

本ケースでは、各変換関数ＴＦがこれら３つの構成要素ＣＰ１、ＣＰ２、及びＣＰ３を含むことが考慮されるが、他の実施態様が可能である。 In the present case it is considered that each transformation function TF comprises these three components CP1, CP2 and CP3, but other implementations are possible.

平行移動成分ＣＰ１は、それぞれのテキストラインＬＮの平行移動を規定する場合がある。拡縮成分ＣＰ２は、それぞれのテキストラインの拡縮操作を規定する場合がある。最後に、回転成分ＣＰ３は、それぞれのテキストラインＬＮの回転を規定する場合がある。 A translation component CP1 may define the translation of the respective text line LN. A scaling component CP2 may define a scaling operation for each line of text. Finally, the rotation component CP3 may define the rotation of the respective text line LN.

特定の実施例では、正規化ステップＳ１６の間、変換関数ＴＦの平行移動成分ＣＰ１は、このテキストラインの原点が、正規化ステップＳ１６の間に、前述のテキストラインにアサインされた対応するベースライン２０２と整列するように移動されるように、テキストライン（たとえば、ＬＮ１）の平行移動を実施するように判定される。 In a particular embodiment, during the normalization step S16, the translation component CP1 of the transformation function TF is calculated such that the origin of this text line is the corresponding baseline It is determined to perform a translation of the text line (eg, LN1) so that it is moved to align with 202 .

特定の実施例では、正規化ステップＳ１６の間、変換関数ＴＦの拡縮成分ＣＰ２は、それぞれのテキストライン（たとえば、ＬＮ１）に関し、ドキュメントパターン２００のラインパターンの２つの連続したベースライン２０２間の距離ｄ１の、テキストラインの高さ（たとえば、ｈ１）に対する割合に基づいて判定される。 In a particular embodiment, during the normalization step S16, the scaling component CP2 of the transformation function TF is the distance between two consecutive baselines 202 of the line pattern of the document pattern 200 for each text line (eg, LN1) It is determined based on the ratio of d1 to the text line height (eg, h1).

特定の実施例では、正規化ステップＳ１６の間、回転成分ＣＰ３は、それぞれのテキストライン（たとえば、ＬＮ３）を回転させて、それぞれのテキストラインの傾斜（たとえば、ａ１）（たとえば、ドキュメントパターン２００のベースライン２０２を基準として取る）を、ドキュメントパターン２００に従ってゼロに低減するように判定される。 In a particular embodiment, during the normalization step S16, the rotation component CP3 rotates each text line (eg, LN3) such that the respective text line's slope (eg, a1) (eg, the document pattern 200's , taken as a reference to the baseline 202 ) is determined to be reduced to zero according to the document pattern 200 .

実施例として、変換関数ＴＦは、正規化の間にテキストラインＬＮに適用される回転を規定する場合がある。傾斜したラインＬＮに適用される回転は、傾斜（ａ）とは反対側の角度の回転である場合がある。テキストラインＬＮの所与の傾斜（ａ）に関し、変換ＴＦの回転成分ＣＰ３は、このため、（－ａ）の回転である場合がある。変換関数ＴＦが回転を含む場合、この変換関数ＴＦは、影響を受けるストローク（複数可）またはストロークの部分の各ポイントの、テキストラインＬＮの原点周りの回転を生じる場合がある。以下にさらに説明するように、ライン情報２２４は、したがって、（ラインの傾斜（ａ）をゼロ：ａ＝０に設定することにより）アップデートされる場合もある。 As an example, the transformation function TF may define the rotation applied to the text line LN during normalization. The rotation applied to the slanted line LN may be an angle of rotation opposite to the slant (a). For a given inclination (a) of the text line LN, the rotation component CP3 of the transform TF may thus be a rotation of (-a). If the transformation function TF involves rotation, this transformation function TF may result in a rotation of each point of the affected stroke(s) or parts of strokes about the origin of the text line LN. As explained further below, the line information 224 may therefore be updated (by setting the slope (a) of the line to zero: a=0).

テキストラインＬＮの回転が、構造化されたフォーマットＦＴ２に変換することが常に必要であるわけではないことに留意されたい。すなわち、テキストラインＬＮがすでに適切な向きにある場合（たとえば、テキストラインＬＮが、ドキュメントパターン２００のそれぞれのベースライン２０２の方向に実質的に沿って、フリーハンドライティングフォーマットＦＴ１で手書きされた場合）である。そのようなケースでは、回転成分ＣＰ３は、ＣＰ３＝０になるように設定することができる。 Note that the rotation of the text line LN is not always necessary to transform it into the structured format FT2. That is, if the text lines LN are already in the proper orientation (eg, if the text lines LN were handwritten in the freehand writing format FT1 substantially along the direction of the respective baseline 202 of the document pattern 200). is. In such cases, the rotational component CP3 can be set such that CP3=0.

同様に、いくつかのケースでは、構成要素ＣＰ１及び／またはＣＰ２は、それぞれ、平行移動及び／または拡縮がそれぞれのテキストラインＬＮを構造化されたフォーマットＦＴ２に変換するために必要ではない場合、ゼロに設定される場合がある。 Similarly, in some cases, components CP1 and/or CP2, respectively, may be zero if no translation and/or scaling is required to transform the respective text line LN into structured format FT2. may be set to

特定の実施例では、コンピュータデバイス１００は、Ｓ１８において、構成要素ＣＰ１、ＣＰ２、及びＣＰ３の少なくとも１つがゼロとは異なるように、テキストラインＬＮ１に関する変換関数ＴＦを判定する。図１１に示す実施例では、変換関数ＴＦは、「Ｔｈｉｓｅｘａｍｐｌｅｓｈｏｗｓ」を手書きとして読み取るテキストラインＬＮ１が、平行移動成分ＣＰ１に従ってディスプレイデバイス１０２上で移動され（平行移動）、拡縮成分ＣＰ２に従って拡縮され（ダウンサイズされ）、また、回転成分ＣＰ３に従ってある角度だけ回転されるように、計算される。 In particular embodiments, computing device 100 determines a transform function TF for text line LN1 such that at least one of components CP1, CP2, and CP3 is different from zero at S18. In the example shown in FIG. 11, the transformation function TF is such that a text line LN1 reading "This example shows" as handwriting is translated (translated) on the display device 102 according to a translation component CP1 and scaled according to a scaling component CP2. (downsized) and rotated by an angle according to the rotation component CP3.

変換ステップＳ２０では、コンピュータデバイス１００は、テキストラインＬＮ１の各ストロークＳＴを構造化されたハンドライティングフォーマットＦＴ１に変換するように、それぞれの変換関数ＴＦを適用する。本実施例では、コンピュータデバイス１００は、こうして、図１１に関してすでに上述したように、平行移動成分ＣＰ１に従ってテキストラインＬＮ１を移動（平行移動）し、拡縮成分ＣＰ２に従ってテキストラインＬＮ１を拡縮し、回転成分ＣＰ３に従ってテキストラインＬＮ１を回転させる。「Ｔｈｉｓｅｘａｍｐｌｅｓｈｏｗｓ」とのフレーズは、図１１に示すように正規化されると、依然として（ストロークによって形成された）手書きを構成するが、正規化された方式で構造的に統計立てられている。 In a transformation step S20, the computing device 100 applies a respective transformation function TF to transform each stroke ST of the text line LN1 into the structured handwriting format FT1. In the present example, computing device 100 thus moves (translates) text line LN1 according to translation component CP1, scales text line LN1 according to scaling component CP2, and rotates text line LN1 according to scaling component CP2, as already described above with respect to FIG. Rotate the text line LN1 according to CP3. The phrase "This example shows", when normalized as shown in FIG. 11, still constitutes handwriting (formed by strokes), but is structurally statistical in the normalized manner. .

アップデートステップＳ２２では、コンピュータデバイス１００は、テキストラインＬＮ１に関してＳ１８において判定されたそれぞれの変換関数ＴＦに基づき、テキストブロックＢＬ１のモデルデータＤＴをアップデートもする。より具体的には、コンピュータデバイス１００は、Ｓ１８で計算されたそれぞれの変換関数ＴＦに従って、テキストラインＬＮ１に関するモデルデータＤＴの一部をアップデートする。 In an update step S22, the computing device 100 also updates the model data DT of the text block BL1 based on the respective transformation function TF determined in S18 for the text line LN1. More specifically, the computing device 100 updates part of the model data DT for the text line LN1 according to each transform function TF calculated at S18.

本実施例では、アップデートステップＳ２２の間、コンピュータデバイス１００は、Ｓ１８において判定されたそれぞれの変換関数ＴＦに基づき、テキストラインＬＮ１に関連するモデルデータＤＴのライン情報２２４をアップデートする。結果として、原点座標（ｘ１，ｙ１）、傾斜情報（ａ１）、及び高さ情報（ｈ１）は、それぞれ、構成要素ＣＰ１、ＣＰ３、及びＣＰ２に従ってアップデートされる。 In this example, during the update step S22, the computing device 100 updates the line information 224 of the model data DT associated with the text line LN1 based on the respective transformation function TF determined at S18. As a result, the origin coordinates (x1, y1), tilt information (a1), and height information (h1) are updated according to components CP1, CP3, and CP2, respectively.

上で示したように、ステップＳ１８、Ｓ２０、及びＳ２２は、ライン抽出ステップＳ１０で検出された各テキストラインＬＮに関し、類似の方式で実施される。結果として、テキストブロックＢＬ１は、このテキストブロックＢＬ１が構造化されたフォーマットＦＴ２に変換され（Ｓ２０）、関連するモデルデータをそれに応じてアップデートさせる（Ｓ２２）ように、正規化を経る（Ｓ１６）。この正規化は、各テキストラインＬＮが専用の変換関数ＴＦに基づいて正規化され、各テキストラインＬＮに関するモデルデータがこの専用の変換関数ＴＦに基づいてアップデートされる意味で、ライン毎の基準で実施される。 As indicated above, steps S18, S20 and S22 are performed in a similar manner for each text line LN detected in line extraction step S10. As a result, the text block BL1 undergoes normalization (S16) such that this text block BL1 is converted into a structured format FT2 (S20) and the associated model data is updated accordingly (S22). This normalization is performed on a line-by-line basis in the sense that each text line LN is normalized based on a dedicated transformation function TF and the model data for each text line LN is updated based on this dedicated transformation function TF. be implemented.

アップデートステップＳ２２は、変換ステップＳ２０の前か後に実施される場合がある。より一般的には、ステップＳ１８からＳ２２のシーケンスが異なるテキストラインＬＮに関して実施される順番は、当業者によって適合させられる場合がある。 The update step S22 may be performed before or after the conversion step S20. More generally, the order in which the sequence of steps S18 to S22 is performed for different text lines LN may be adapted by those skilled in the art.

図５に示すように、コンピュータデバイスは、アップデートされたモデルデータＤＴをメモリ８（図２）に記憶する場合がある（Ｓ２４）。この記憶ステップは、アップデートＳ２２がテキストブロックＢＬ１の各テキストラインＬＮに関して実施される間、継続的に実施される場合がある。本実施例では、テキストブロックＢＬ１のアップデートされたモデルデータＤＴは、Ｓ１４で記憶されたオリジナルのモデルデータＤＴの代わりにＳ２４で記憶される。 As shown in Figure 5, the computing device may store the updated model data DT in the memory 8 (Figure 2) (S24). This storage step may be performed continuously while the update S22 is performed for each text line LN of the text block BL1. In this embodiment, the updated model data DT of the text block BL1 are stored at S24 instead of the original model data DT stored at S14.

さらに、コンピュータデバイス１００は、図１０に示すように、正規化されたテキストブロックＢＬ１を表示する（Ｓ２６）場合がある。正規化されると、テキストブロックＢＬ１は、前述のように、ドキュメントパターン２００に従う（マッチする）ように、構造化されたフォーマットＦＴ２に従って配置される。 Further, computing device 100 may display normalized text block BL1 (S26), as shown in FIG. Once normalized, text block BL1 is arranged according to structured format FT2 to follow (match) document pattern 200, as described above.

各テキストラインＬＮのベースライン２０２は、対応するライン情報２２４において規定された原点及び傾斜を使用して、コンピュータデバイス１００によって描写される場合がある。さらに、各テキストラインＬＮの高さの値は、ディスプレイデバイス１０２上のハイライトされた矩形（または類似のもの）を使用して示される場合がある。各テキストラインＬＮの原点は、テキストラインが正規化の間に拡縮される場合にアンカーポイントとして使用される場合がある。 Baseline 202 of each text line LN may be rendered by computing device 100 using the origin and slope defined in corresponding line information 224 . Additionally, the height value of each text line LN may be indicated using a highlighted rectangle (or similar) on display device 102 . The origin of each text line LN may be used as an anchor point when the text line is scaled during normalization.

フリーハンドライティングフォーマットから構造化された状況またはフォーマット（ＦＴ２など）への、テキストブロックＢＬ１のテキストの正規化Ｓ１６は、インクの相互作用性によって提供される全体の経験を向上させる。正規化は、より一様かつ構造化された方式で手書きテキスト入力を配置及び表示することを可能にするのみならず、正規化は、以下に説明するように、テキスト認識システムの信頼性及び編集効率を向上させることを可能にもする。 Normalizing S16 the text of text block BL1 from a freehand writing format to a structured context or format (such as FT2) enhances the overall experience provided by the interactivity of ink. Not only does normalization allow handwritten text input to be arranged and displayed in a more uniform and structured manner, normalization also improves the reliability and editing of text recognition systems, as described below. It also makes it possible to improve efficiency.

本発明は、相互作用性インクを形成するように、デジタルインクをモデルデータで構造的にすることを可能にする。前に示したように、モデルデータＤＴは、各ストロークＳＴを特有の文字ＣＨにリンクさせ、各文字ＣＨを特有のワードＷＤにリンクさせ、また、各ワードＷＤを特有のテキストラインＬＮにリンクさせる相互参照を規定し、それにより、インクの相互作用性を可能にする。たとえば、テキストブロック内のワードを消去することは、ドキュメントモデルに従ってこのワードによって参照されたすべての文字を消去することをも含んでいる。別の実施例として、所与のラインＬＮのすべてのコンテンツを消去することは、このラインをモデルデータから消去するのみならず、コンピュータデバイス１００に、このラインの構成要素であるワードＷＤ及び文字ＣＨのすべてを消去させもする。前述のようなライン毎の正規化プロセスを可能にするのは、モデルデータＤＴによって規定されたこれら相関関係である。 The present invention allows digital ink to be structured with model data to form interactive ink. As indicated above, the model data DT links each stroke ST to a unique character CH, each character CH to a unique word WD, and each word WD to a unique text line LN. Cross-references are defined, thereby enabling ink interactivity. For example, deleting a word in a text block also includes deleting all characters referenced by this word according to the document model. As another example, erasing all the contents of a given line LN not only erases this line from the model data, but also tells the computing device 100 that this line's constituent words WD and characters CH It also erases all of the . It is these correlations defined by the model data DT that enable the line-by-line normalization process as described above.

本発明のおかげで、入力された手書きテキストを示す関連するモデルデータを最新のものに維持し、それにより、正規化プロセスの間に実施される変換に応じてさらなるテキスト認識を実施する必要がないようになしつつ、手書きテキスト入力を正規化することができる。換言すると、正規化プロセスの一部として、コンピュータデバイス１００は、モデルデータにおいて規定されたような手書きテキスト入力ＩＮのストロークＳＴと、文字ＣＨと、ワードＷＤと、テキストラインＬＮとの間の相関関係をアップデートし、それにより、正規化されたテキストブロックＢＬ１にテキスト認識をふたたび実施することが不要であるようになっている。正規化されたテキストブロックＢＬ１は、別様に、Ｓ８で実施された最初のテキスト認識の結果と矛盾する場合がある。 Thanks to the invention, the relevant model data representing the input handwritten text is kept up-to-date so that no further text recognition needs to be performed depending on the transformations performed during the normalization process. Handwritten text input can be normalized while doing so. In other words, as part of the normalization process, computing device 100 calculates the correlation between strokes ST, characters CH, words WD, and text lines LN of handwritten text input IN as defined in the model data. , so that it is unnecessary to perform text recognition again on the normalized text block BL1. The normalized text block BL1 may otherwise contradict the result of the first text recognition performed at S8.

一般的に言えば、既知のコンピュータデバイスにおけるテキスト認識エンジンは、通常、手書きテキスト上で起こるあらゆるインクの変更または編集（たとえば、ストロークの消去または編集）を監視するように構成されている。したがって、そのような既知のコンピュータデバイス上でテキストの正規化が行われた場合、このことは、テキスト認識エンジンに、正規化の間、構造化されたフォーマットに一度変換された手書きテキスト入力内の可能性のある新たなコンテンツを認識することを試みるように、ふたたび実行させる。換言すると、既知のシステム内のデジタルインクの正規化は、以前の認識の結果を廃棄し、新たな認識の結果を得るためにインク全体を再処理することに繋がる。そのような応答するテキスト認識機構は、インクの変更が、たとえば、フリーハンドライティングモードにおいてユーザの編集のいくつかの形態から生じる場合に有用である場合があるが、コンピュータデバイス１００が、フリーハンドライティングフォーマットＦＴ１から構造化されたフォーマットＦＴ２へのテキストブロックＢＬ１の正規化Ｓ１６に応じて新たなテキスト認識を実施するには、実際には非生産的である。正規化Ｓ１６に応じた任意のさらなるテキスト認識は、フリーハンドライティングモードＦＴ１で手書きテキスト入力ＩＮに実施された最初のテキスト認識Ｓ８を汚す、または矛盾するリスクがある。さらに、そのようなさらなるテキスト認識プロセスを実施することは、時間及びリソースを必要とする。正規化の間にデジタルインクに適用される変換関数ＴＦに応じてモデルデータＤＴをアップデートすることは、時間及びリソースを節約しつつ、デジタルインク、及び関連するデータを、一定かつ一貫した状態のままにすることを可能にする。 Generally speaking, text recognition engines in known computing devices are typically configured to monitor any ink changes or edits (eg, stroke erasures or edits) that occur on handwritten text. Thus, when text normalization is performed on such known computing devices, this tells the text recognition engine that the handwritten text input, once converted to a structured format during normalization, Let it run again to try to recognize possible new content. In other words, normalization of digital ink in known systems leads to discarding previous recognition results and reprocessing the entire ink to obtain new recognition results. Such a responsive text recognition mechanism may be useful where the ink changes result from some form of user editing, for example, in a freehand writing mode, where the computing device 100 is in a freehand writing mode. It is actually counterproductive to perform a new text recognition according to the normalization S16 of the text block BL1 from the format FT1 to the structured format FT2. Any further text recognition in response to normalization S16 risks corrupting or contradicting the initial text recognition S8 performed on handwritten text input IN in freehand writing mode FT1. Moreover, implementing such additional text recognition processes requires time and resources. Updating the model data DT according to the transformation function TF applied to the digital ink during normalization saves time and resources while keeping the digital ink and associated data in a constant and consistent state. allows you to

テキストブロックＢＬ１に実施される正規化Ｓ１６に応じた新たなテキスト認識を引き起こす代わりに、本発明のコンピュータデバイス１００は、正規化の間にテキストラインＬＮに適用される変換関数ＴＦに応じてテキストブロックＢＬ１のモデルデータＤＴをアップデートするように構成されている。 Instead of invoking a new text recognition depending on the normalization S16 performed on the text block BL1, the computer device 100 of the present invention generates the text block according to the transformation function TF applied to the text line LN during normalization. It is configured to update the model data DT of BL1.

本発明の正規化プロセスのポイントは、所与の最初の認識の結果（すなわち、最初のテキスト認識Ｓ８の結果）に関するデジタルインクを操作することである。このため、コンピュータデバイス１００は、テキストブロックＢＬ１のデジタルインクを構造化されたフォーマットＦＴ２に変換し、それに応じてドキュメントモデルをアップデートする場合があり、それにより、フリーハンドライティングモードで取得された最初のテキスト認識Ｓ８の結果が疑われないようになっている。 The point of the normalization process of the present invention is to manipulate the digital ink on a given initial recognition result (ie, the result of the initial text recognition S8). As such, computing device 100 may convert the digital ink of text block BL1 to structured format FT2 and update the document model accordingly, thereby resulting in the initial The result of the text recognition S8 is not suspected.

特定の実施形態では、正規化ステップＳ１６の間に、各テキストラインＬＮのモデルデータＤＴが、Ｓ１８で取得されたそれぞれの変換関数ＴＦに従ってアップデートされ、一方、それぞれの変換関数ＴＦの適用から生じるあらゆるテキスト認識をブロックする。結果として、コンピュータデバイス１００は、ステップＳ１６においてテキストブロックＢＬ１に実施された正規化に応じて、あらゆる新たなテキスト認識を抑制する。このため、認識されたテキストは、デジタルインクが正規化の間に変換される場合であっても、安定したままであり、それにより、認識システム全体の信頼性及び効率を向上させる。 In a particular embodiment, during the normalization step S16, the model data DT of each text line LN are updated according to the respective transformation function TF obtained at S18, while any Block text recognition. As a result, computing device 100 suppresses any new text recognition in response to the normalization performed on text block BL1 in step S16. Thus, the recognized text remains stable even when the digital ink is transformed during normalization, thereby improving the reliability and efficiency of the overall recognition system.

しかし、（図１０に示すように）テキストブロックＢＬ１が構造化されたフォーマットＦＴ２に正規化されると、新たなテキスト認識が、編集のあらゆる次の形態、または正規化されたテキストブロックＢＬ１で後に実施されるインクの変更を反映するように引き起こされる場合があることを理解されたい。たとえば、ユーザが、（たとえば、ストローク、または、ワードＷＤもしくは文字ＣＨのストロークの部分を消去すること、またはこれを修正することにより）構造化されたモードＦＴ２で第１のテキストラインＬＮ１を編集することを決める場合、コンピュータデバイス１００は、最初のテキスト認識Ｓ８の結果とは異なる場合があるテキストを認識するように、新たなテキスト認識を実施する場合がある。 However, if the text block BL1 is normalized to a structured format FT2 (as shown in FIG. 10), the new text recognition will be applied to any subsequent form of editing or later in the normalized text block BL1. It should be understood that it may be triggered to reflect the ink changes that are implemented. For example, the user edits the first text line LN1 in structured mode FT2 (e.g. by erasing or modifying the stroke or part of the stroke of the word WD or the character CH). If so, the computing device 100 may perform a new text recognition to recognize text that may differ from the results of the initial text recognition S8.

したがって、コンピュータデバイス１００は、好ましくは、正規化ステップＳ１６の後に実施される編集操作に応じて、新たなテキスト認識を引き起こす。そのようなケースでは、テキスト認識機能は、コンピュータデバイス１００においては恒久的にブロックされないが、代わりに、構造化されたフォーマットＦＴ２においてテキストブロックＢＬ１が後に編集されるケースにおいて、新たなコンテンツを認識することを可能にするために、正規化Ｓ１６が完了すると、アクティブなままに維持される。手書きテキストは、このため、任意の形態の編集など（たとえば、テキストのリフローなど）、さらなる操作及び相互作用に関する正規化の後に、コンピュータデバイス１００によってより確実かつ容易に処理される場合がある。 Accordingly, the computing device 100 preferably triggers new text recognition in response to editing operations performed after the normalization step S16. In such a case, the text recognition function is not permanently blocked in the computing device 100, but instead recognizes new content in case the text block BL1 is later edited in the structured format FT2. is kept active when the normalization S16 is completed to allow for Handwritten text may thus be more reliably and easily processed by computing device 100 after normalization for further manipulation and interaction, such as any form of editing (eg, reflowing text, etc.).

さらに、本発明は、ユーザが、フリーハンドライティングモードで手書きテキストを入力することを可能にする。本来、ライン、サイズ、向きの制約（ガイドライン、余白などを含む）は、フリーハンドライティングモードではユーザに課されず、それにより、手書きの様々な複雑な形態が入力される場合があるようになっている。前に記載したように、手書きテキスト入力を構造化されたハンドライティングフォーマットに正規化することにより、手書きテキスト上でコンピュータデバイス１００によって編集機能を実施すること（修正、拡縮など）が促進され得る。正規化は、たとえば、コンピュータデバイスによって受領される、ユーザが開始するコマンドから生じるか、コンピュータデバイスによって検出される任意の所定の事象から生じる場合がある。フリーハンドライティングモードが最初に使用されている場合であっても、ユーザは、より構造化され、向上された方式で手書き入力を編集及び操作する場合があり、それにより、ユーザの経験を向上させる。 Additionally, the present invention allows users to enter handwritten text in a freehand writing mode. By nature, line, size, and orientation constraints (including guidelines, margins, etc.) are not imposed on the user in freehand writing mode, thereby allowing various complex forms of handwriting to be input. ing. As previously described, normalizing handwritten text input into a structured handwriting format may facilitate performing editing functions by computing device 100 on handwritten text (correction, scaling, etc.). Normalization may result, for example, from a user-initiated command received by the computing device, or from any predetermined event detected by the computing device. Even when the freehand writing mode is initially used, the user may edit and manipulate the handwriting input in a more structured and enhanced manner, thereby enhancing the user's experience. .

たとえば図１２に示すように、正規化Ｓ１６が完了すると、コンピュータデバイス１００は、編集ステップＳ２８の間に編集操作を実施する場合がある。本実施例では、正規化されたテキストブロックＢＬ１に対し、水平方向にテキストリフローが実施される（垂直なテキストリフローも可能である）。結果として、構造化されたフォーマットＦＴ２におけるテキストブロックＢＬ１の異なるデジタルインクのストロークの相対位置が再配置される。そのような編集操作は、任意の適切なユーザのコマンド（たとえば、入力表面１０４上でのユーザのジェスチャ）に応じて、ユーザによって引き起こされるか、所定の事象が検出された際にコンピュータデバイス１００自体によって引き起こされる場合がある。 For example, as shown in FIG. 12, once normalization S16 is complete, computing device 100 may perform editing operations during editing step S28. In this example, a horizontal text reflow is performed on the normalized text block BL1 (vertical text reflow is also possible). As a result, the relative positions of different digital ink strokes of text block BL1 in structured format FT2 are rearranged. Such editing operations may be triggered by the user in response to any suitable user command (e.g., a user gesture on input surface 104), or may be triggered by computing device 100 itself upon detection of a predetermined event. may be caused by

いくつかの代替的実施態様では、ブロックに書き込まれた機能は、図に書かれた順番とは異なって生じる場合があることに留意されたい。たとえば、関連する機能に応じて、連続して示される２つのブロックは、実際は、実質的に同時に実行される場合があり、または、各ブロックは、ときには逆の順番で実行される場合があり、または、各ブロックは、代替的な順番で実行される場合がある。 Note that in some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently or each block may sometimes be executed in the reverse order, depending on the functionality involved; Alternatively, each block may be executed in an alternate order.

本発明が、特定の実施形態において記載されてきたが、当業者の能力の中で、添付の特許請求の範囲の範囲に応じて、多くの変更形態及び実施形態を受け入れることができることが明らかである。具体的には、当業者は、添付の特許請求の範囲の範囲内にある、この文献に記載の様々な実施形態の任意及びすべての組合せ及び変形形態を予期する場合がある。 Although the present invention has been described in specific embodiments, it is evident that many modifications and embodiments within the ability of those skilled in the art are susceptible to the scope of the appended claims. be. Specifically, those skilled in the art may contemplate any and all combinations and variations of the various embodiments described in this document that fall within the scope of the appended claims.

Claims

A method implemented by a computing device (100) for processing handwritten text, comprising:
Detecting (S2) a plurality of input strokes (ST) of digital ink (IN) on an input surface (104), said input strokes being in a freehand writing format (ST) without any handwriting constraints. said detecting (S2), input in FT1);
displaying (S4) the plurality of input strokes (ST) on a display device (102) in the freehand writing format;
classifying (S6) each input stroke as text or non-text, said classifying comprising from said input strokes handwritten in said freehand writing format (FT1) at least one of handwritten text; said classifying (S6) comprising detecting text blocks (BL1) as text;
performing (S8) text recognition on said at least one text block (BL1), said text recognition comprising:
extracting text lines of handwritten text from said at least one text block (S10);
generating model data (DT) associating each stroke (ST) of said at least one text block with a character (CH), a word (WD) and a text line (LN) of said at least one text block (S12); ),
said performing (S8) comprising
normalizing (S16) each text line (LN) of the handwritten text from said freehand writing format to a structured handwriting format (FT2) so as to follow the line pattern of the document pattern (200); , wherein the normalization is for each text line,
calculating (S18) respective transformation functions (TF) for said text lines so as to transform said text lines (LN) into said structured handwriting format (FT2);
applying (S20) each said transformation function (TF) so as to transform each stroke of said text line into said structured handwriting format (FT2);
updating (S22) the model data (DT) of the text lines based on the respective transformation function;
the normalizing (S16) comprising
The above method, comprising

storing the model data (DT) generated during the text recognition;
said updating the model data further comprises storing the updated model data of the at least one text block to replace the model data generated during the text recognition (S24); The method of claim 1.

3. The method of claim 1 or claim 2, comprising displaying (S26) the text lines of the at least one text block in the structured handwriting format (FT2) after the normalizing. Method.

said model data (DT) of said at least one text block (BL1) is
Character information (220) defining a plurality of characters, each character being associated with at least one stroke of digital ink and a line of text of said at least one text block. When,
word information (222) defining a plurality of words, each word associated with at least one character defined by said character information;
line information (224) defining each text line of said at least one text block, each text line being associated with at least one word defined by said word information; When,
4. The method of any one of claims 1-3, comprising:

wherein the line information relates to each text line of the at least one text block;
origin coordinates indicating the origin of the text line;
slant information indicating the slant of the text line;
height information indicating the height of the text line;
5. The method of claim 4, comprising:

Claim 4 or claim 4, wherein said updating said model data (DT) during said normalization comprises updating said line information (224) of said text lines based on said respective said transformation function. 5. The method described in 5.

For each line of text, the normalizing includes:
determining input parameters including the origin coordinates, the tilt information, and the height information of the text line;
each said transformation function is calculated based on said input parameters and said document pattern;
7. A method according to any one of claims 1-6.

wherein the document pattern is a handwritten constraint to which handwritten text follows;
the margins of the display area,
the distance between the lines and
8. A method according to any one of claims 1 to 7, defining at least one of

each transform function being a transform component applied to a respective text line during said normalization;
a translation component (CP1);
a scaling component (CP2);
a rotational component (CP3);
9. A method according to any one of claims 1 to 8, defining at least one of

10. The method of claim 9, wherein the document pattern comprises the line pattern (202) defining guidelines, according to which handwritten text is to be laid out in the structured handwriting format (FT2). Method.

The scaling component (CP2) of the transformation function is determined during the normalization based on the ratio of the distance between two consecutive guidelines of the line pattern to the height of each of the text lines. 11. The method of claim 10.

The translation component of the transformation function is such that during the normalization the origin of the text line is aligned with a corresponding guideline of the line pattern assigned to the text line during the normalization. 12. A method according to claim 10 or 11, wherein it is determined to perform a translation of the text line so that it is moved to .

The rotation component (CP3) is determined during the normalization to rotate each text line, thereby reducing the slope of each text line to zero according to the document pattern (200). A method according to any one of claims 10 to 9.

During the normalization, the model data (DT) of each text line is updated according to the respective transform function to block any text recognition that may result from the applying the respective transform function. 14. A method according to any one of claims 1-13.

A computer device for handwritten text, comprising:
An input surface (104) for detecting multiple strokes (ST) of digital ink (IN), said strokes being input in freehand writing format (FT1) without any handwriting constraints. said input surface (104),
a display device (102) for displaying the plurality of input strokes in the freehand writing format;
a classifier (MD2) for classifying each stroke as text or non-text, said classifier classifying at least one of handwritten text from said input strokes handwritten in said freehand writing format (FT1); said classifier (MD2) configured to detect two text blocks as text;
a line extractor (MD4) for extracting text lines of handwritten text from said at least one text block;
for performing text recognition of each text line of the at least one text block and generating model data associating each stroke of the at least one text block with characters, words and text lines of the at least one text block; a recognition engine (MD6) of
a text editor (MD8) for normalizing each text line of handwritten text from said freehand writing format to structured handwriting format (FT2) to follow a line pattern of a document pattern, said text The editor will, for each line of text,
calculating respective transformation functions for the text lines to transform the text lines into the structured handwriting format;
applying each of the conversion functions to convert each stroke of the text line to the structured handwriting format; and
updating the model data for the text lines based on the respective transformation functions;
said text editor (MD8) configured to implement
The computing device, comprising: